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METHODS FOR SCREENING FOR TRANSDOMINANT INTRACELLULAR 
EFFECTOR PEPTIDES AND RNA MOLECULES 



FIELD OF THE INVENTION 
The technical field of this invention is methods for screening for transdominant effector 
peptides and RNA molecules selected inside living cells from randomized pools. 

BACKGROUND OF THE INVENTION 

Signaling pathways in cells often begin with an effector stimulus that leads to a 
phenotypically describable change in cellular physiology. Despite the key role 
intracellular signaling pathways play in disease pathogenesis, in most cases, little is 
understood about a signaling pathway other than the initial stimulus and the ultimate 
cellular response. 

Historically, signal transduction has been analyzed by biochemistry or genetics. The 
biochemical approach dissects a pathway in a "stepping-stone" fashion: find a 
molecule that acts at, or is involved in, one end of the pathway, isolate assayable 
quantities and then try to determine the next molecule in the pathway, either upstream 
or downstream of the isolated one. The genetic approach is classically a "shot in the 
dark": induce or derive mutants in a signaling pathway and map the locus by genetic 
crosses or complement the mutation with a cDNA library. Limitations of biochemical 
approaches include a reliance on a significant amount of pre-existing knowledge about 
the constituents under study and the need to carry such studies out in vitro, 
post-mortem. Limitations of purely genetic approaches include the need to first derive 
and then characterize the pathway before proceeding with identifying and cloning the 
gene. 
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Screening molecular libraries of chemical compounds for drugs that regulate signal 
systems has led to important discoveries of great clinical significance. Cyclosporin A 
(CsA) and FK506, for examples, were selected in standard pharmaceutical screens for 
inhibition of T-cell activation. It is noteworthy that while these two drugs bind 
5 completely different cellular proteins - cyciophilin and FK506 binding protein (FKBP), 

respectively, the effect of either drug is virtually the same » profound and specific 
suppression of T-cell activation, phenotypically observable in T cells as inhibition of 
mRNA production dependent on transcription factors such as NF-AT and NF-kB. 
Libraries of small peptides have also been successfully screened in vitro in assays for 
1 0 bioactivity . The literature is replete with examples of small peptides capable of 

modulating a wide variety of signaling pathways. For example, a peptide derived from 
the HIV-1 envelope protein has been shown to block the action of cellular calmodulin. 



A major limitation of conventional in vitro screens is delivery. While only minute 
amounts of an agent may be necessary to modulate a particular cellular response, 
15 delivering such an amount to the requisite subcellular location necessitates exposing 

the target cell or system to relatively massive concentrations of the agent. The effect 
of such concentrations may well mask or preclude the targeted response. 



Thus, it is an object of the present invention to provide methods and compositions for 
the effective introduction of random libraries into cells to screen for bioactive 
20 compounds. 



Relevant Literature 



Mann et al. (1983) Cell 33, 153-159, Pear et al. (1993) Proc. Natl. Acad. Sci. USA 
90(18):8392-6 and WO 94/19478 describe the BOSC and BING retroviral systems 
useful as delivery vectors for the disclosed methods. 

2 5 Scott and Craig (1 994) Current Opinion in Biotechnology 5:40-48 review random 

peptide libraries. Hupp et al. (1995) describe small peptides which activate the latent 
sequence-specific DNA binding function of p53. Palzkill et al. (1 994) report the 
selection of functional signal cleavage sites from a library of random sequences 
introduced into TEM-1 -lactamase. 
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SUMMARY OF THE INVENTION 

The invention provides methods and compositions for screening for transdominant 
bioactive agents such as pharmaceuticals. The invention accesses molecules or 
targets within living cells and provides for the direct selection of those bioactive agents 
with desired phenotypic effects. 

In one aspect of the invention, methods for screening for a transdominant bioactive 
agent capable of altering the phenotype of a cell are provided. The methods comprise 
the steps of a) introducing a molecular library of randomized candidate nucleic acids 
into a plurality of cells, wherein each of said nucleic acids comprises a different 
nucleotide sequence; b) screening the plurality of cells for a cell exhibiting an altered 
phenotype, wherein the altered phenotype is due to the presence of a transdominant 
bioactive agent. The methods may also include the steps of c) isolating the cell(s) 
exhibiting an altered phenotype, d) isolating a candidate nucleic acid from the cell(s). 

The invention further provides methods for isolating a target molecule using either a 
candidate nucleic acid or the expression product of a candidate nucleic acid. 

in an additional aspect, the candidate nucleic acids of the invention are linked to fusion 
partners. 

In a further aspect, the invention provides methods for screening for a transdominant 
bioactive agent capable of altering the phenotype of a cell. The methods comprises 
the steps of a) introducing a molecular library of randomized candidate nucleic acids 
into a first plurality of cells, wherein each of the nucleic acids comprises a different 
nucleotide sequence; b) contacting the first plurality of cells with a second plurality of 
cells; and c) screening the second plurality of cells for a cell exhibiting an altered 
phenotype. 

In an additional aspect, the present invention provides molecular libraries of 
retroviruses comprising different randomized nucleic acids, and cellular libraries 
containing the retroviral libraries. 



BRIEF DESCRIPTION OF THE FIGURES 



WO 97/27212 



-4- 



PCT/US97/01019 



Figure 1. Creation of a library of random peptides in a retrovirus DNA construct by 
PCR. 



Figure 2. Creation of a library of random peptides in a retrovirus DNA construct by 
primed DNA synthesis. 

5 Figure 3. Presentation constructs for localizing presentation structures to specific 

cellular locales. 



Figure 4. Schematic of a retroviral construct. 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention provides methods and compositions to create, effectively 
10 introduce into cells and screen compounds that affect a signaling pathway. Little or no 

knowledge of the pathway is required, other than a presumed signaling event and an 
observable physiologic change in the target cell. The disclosed methods are 
conceptually distinct from prior library search methods in that it is an in vivo stratagem 
for accessing intracellular signaling mechanisms. The invention also provides for the 
15 isolation of the constituents of the pathway, the tools to characterize the pathway, and 

lead compounds for pharmaceutical development. 



The present invention provides methods for the screening of candidate bioactive 
agents which are capable of altering the phenotype of cells containing the agents. The 
methods of the present invention provide a significant improvement over conventional 

2 0 screening techniques, as they allow the rapid screening of large numbers of random 

oligonucleotides and their corresponding expression products in a single, in vivo step. 
Thus, by delivering the random oligonucleotides to cells and screening the same cells, 
without the need to collect or synthesize in vitro the candidate agents, highly efficient 
screening is accomplished. In addition, the present methods allow screening in the 

2 5 absence of significant prior characterization of the cellular defect per se. 

Thus, the present invention provides methods for screening candidate bioactive agents 
for a transdominant bioactive agent capable of altering the phenotype of a cell. 
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By "candidate bioactive agents" or "candidate drugs" or "candidate expression 
products" or grammatical equivalents herein is meant the expression product of a 
candidate nucleic acid which may be tested for the ability to transdominantly alter the 
phenotype of a cell. As is described below, the candidate bioactive agents are the 
5 expression products of candidate nucleic acids, and encompass several chemical 

classes, including peptides and nucleic acids such as DNA, messenger RNA (mRNA), 
antisense RNA, ribozyme components, etc. Thus, the candidate bioactive agents 
(expression products) may be either translation products of the candidate nucleic 
acids, i.e. peptides, or transcription products of the candidate nucleic acids, i.e. either 
10 DNA or RNA. 



In a preferred embodiment, the candidate bioactive agents are translation products of 
the candidate nucleic acids. In this embodiment, the candidate nucleic acids are 
introduced into the cells, and the cells express the nucleic acids to form peptides. 
Thus, in this embodiment, the candidate bioactive agents are peptides. Generally, 
15 peptides ranging from about 4 amino acids in length to about 100 amino acids may be 

used, with peptides ranging from about 5 to about 50 being preferred, with from about 
5 to about 30 being particularly preferred and from about 6 to about 20 being especially 
preferred. 



In a preferred embodiment, the candidate bioactive agents are transcription products 
2 0 of the candidate nucleic acids, and are thus also nucleic acids. The transcription 

products may be either primary transcripts or secondary translation products. That is, 
using the retroviral reverse transcriptase, primary DNA is made which are later 
converted into double stranded DNA. Additionally, using the primary DNA, RNA 
transcripts can be generated within the cell, including mRNA, antisense RNA and 

2 5 ribozymes or portions thereof. 

At a minimum, the candidate bioactive agents comprise randomized expression 
products of the candidate nucleic acids. That is, every candidate bioactive agent has a 
randomized portion, as defined below, that is the basis of the screening methods 
outlined herein. In addition, to the randomized portion, the candidate bioactive agent 

3 0 may also include a fusion partner. 
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In a preferred embodiment the candidate bioactive agents are linked to a fusion 
partner. By "fusion partner" or "functional group" herein is meant a sequence that is 
associated with the candidate bioactive agent, that confers upon all members of the 
library in that class a common function or ability. Fusion partners can be heterologous 
5 (i.e. not native to the host cell), or synthetic (not native to any cell). Suitable fusion 

partners include, but are not limited to: a) presentation structures, as defined below, 
which provide the candidate bioactive agents in a conformationally restricted or stable 
form; b) targeting sequences, defined below, which allow the localization of the 
candidate bioactive agent into a subcellular or extracellular compartment; c) rescue 

10 sequences as defined below, which allow the purification or isolation of either the 

candidate bioactive agents or the nucleic acids encoding them; d) stability sequences, 
which confer stability or protection from degradation to the candidate bioactive agent or 
the nucleic acid encoding it, for example resistance to proteolytic degradation; e) 
dimerization sequences, to allow for peptide dimerization; or f) any combination of a), 

15 b), c), d), and e), as well as linker sequences as needed. 

In a preferred embodiment, the fusion partner is a presentation structure. By 
"presentation structure" or grammatical equivalents herein is meant a sequence, 
which, when fused to candidate bioactive agents, causes the candidate agents to 
assume a conformationally restricted form. Proteins interact with each other largely 

2 0 through conformationally constrained domains. Although small peptides with freely 

rotating amino and carboxyl termini can have potent functions as is known in the art, 
the conversion of such peptide structures into pharmacologic agents is difficult due to 
the inability to predict side-chain positions for peptidomimetic synthesis. Therefore the 
presentation of peptides in conformationally constrained structures will benefit both the 
25 later generation of pharmaceuticals and will also likely lead to higher affinity 

interactions of the peptide with the target protein. This fact has been recognized in the 
combinatorial library generation systems using biologically generated short peptides in 
bacterial phage systems. A number of workers have constructed small domain 
molecules in which one might present randomized peptide structures. 

3 0 While the candidate bioactive agents may be either nucleic acid or peptides, 

presentation structures are preferably used with peptide candidate agents. Thus, 
synthetic presentation structures, i.e. artificial polypeptides, are capable of presenting 
a randomized peptide as a conformationally-restricted domain. Generally such 
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presentation structures comprise a first portion joined to the N-terminal end of the 
randomized peptide, and a second portion joined to the C-terminal end of the peptide; 
that is, the peptide is inserted into the presentation structure, although variations may 
be made, as outlined below. To increase the functional isolation of the randomized 
5 expression product, the presentation structures are selected or designed to have 

minimal biologically activity when expressed in the target cell. 

Preferred presentation structures maximize accessibility to the peptide by presenting it 
on an exterior loop. Accordingly, suitable presentation structures include, but are not 
limited to, minibody structures, loops on beta-sheet turns and coiled-coil stem 
1 0 structures in which residues not critical to structure are randomized, zinc-finger 

domains, cysteine-linked (disulfide) structures, transglutaminase linked structures, 
cyclic peptides, B-loop structures, helical barrels or bundles, leucine zipper motifs, etc. 

In a preferred embodiment, the presentation structure is a coiled-coil structure, 
allowing the presentation of the randomized peptide on an exterior loop. See, for 
15 example, Myszka et al„ Biochem. 33:2362-2373 (1994), hereby incorporated by 

reference, and Figure 3). Using this system investigators have isolated peptides 
capable of high affinity interaction with the appropriate target. In general, coiled-coil 
structures allow for between 6 to 20 randomized positions. 

A preferred coiled-coil presentation structure is as follows: 
2 0 MGC AALESEVSALESEVASLjSEVAAL GRGPMP LAAVKSKLSAVKSKLASVKSKLAA 
CGPP. The underlined regions represent a coiled-coil leucine zipper region defined 
previously (see Martin et aL, EMBO J. 13(22):5303-5309 (1994), incorporated by 
reference). The bolded GRGDMP region represents the loop structure and when 
appropriately replaced with randomized peptides (Le.candidate bioactive agents, 

2 5 generally depicted herein as (X) nt where X is an amino acid residue and n is an integer 

of at least 5 or 6) can be of variable length. The replacement of the bolded region is 
facilitated by encoding restriction endonuclease sites in the underlined regions, which 
allows the direct incorporation of randomized oligonucleotides at these positions. For 
example, a preferred embodiment generates a Xhol site at the double underlined LE 

3 0 site and a Hindlll site at the double-underlined KL site. 
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In a preferred embodiment the presentation structure is a minibody structure. A 
"minibody" is essentially composed of a minimal antibody complementarity region. The 
minibody presentation structure generally provides two randomizing regions that in the 
folded protein are presented along a single face of the tertiary structure. See for 
5 example Bianchi et al., J. Mol. Biol. 236(2):649-59 (1994), and references cited therein, 

all of which are incorporated by reference). Investigators have shown this minimal 
domain is stable in solution and have used phage selection systems in combinatorial 
libraries to select minibodies with peptide regions exhibiting high affinity, Kd = 10' 7 , for 
the pro-inflammatory cytokine IL-6. 

10 A preferred minibody presentation structure is as follows: 

MGRNSQATSfiQFSIjFYME 

QSILYLQKKKGPP. The bold, underline regions are the regions which may be 
randomized. The italized phenylalanine must be invariant in the first randomizing 
region. The entire peptide is cloned in a three-oligonucleotide variation of the coiled- 
15 coil embodiment, thus allowing two different randomizing regions to be incorporated 

simultaneously. This embodiment utilizes non-palindromic BstXI sites on the termini. 

In a preferred embodiment, the presentation structure is a sequence that contains 
generally two cysteine residues, such that a disulfide bond may be formed, resulting in 
a conformationally constrained sequence. This embodiment is particularly preferred 
2 0 when secretory targeting sequences are used. As will be appreciated by those in the 

art, any number of random sequences, with or without spacer or linking sequences, 
may be flanked with cysteine residues. In other embodiments, effective presentation 
structures may be generated by the random regions themselves. For example, the 
random regions may be "doped" with cysteine residues which, under the appropriate 

2 5 redox conditions, may result in highly crosslinked structured conformations, similar to a 

presentation structure. Similarly, the randomization regions may be controlled to 
contain a certain number of residues to confer li-sheet or a-helical structures. 

In a preferred embodiment, the fusion partner is a targeting sequence. As will be 
appreciated by those in the art, the localization of proteins within a cell is a simple 

3 0 method for increasing effective concentration and determining function. For example, 

RAF1 when localized to the mitochondrial membrane can inhibit the anti-apoptotic 
effect of BCL-2. Similarly, membrane bound Sos induces Ras mediated signaling in T- 
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lymphocytes. These mechanisms are thought to rely on the principle of limiting the 
search space for ligands, that is to say, the localization of a protein to the plasma 
membrane limits the search for its ligand to that limited dimensional space near the 
membrane as opposed to the three dimensional space of the cytoplasm. Alternatively, 
5 the concentration of a protein can also be simply increased by nature of the 

localization. Shuttling the proteins into the nucleus confines them to a smaller space 
thereby increasing concentration. Finally, the ligand or target may simply be localized 
to a specific compartment, and inhibitors must be localized appropriately. 

Thus, suitable targeting sequences include, but are not limited to, binding sequences 
capable of causing binding of the expression product to a predetermined molecule or 
class of molecules while retaining bioactivity of the expression product, (for example by 
using enzyme inhibitor or substrate sequences to target a class of relevant enzymes); 
sequences signalling selective degradation, of itself or co-bound proteins; and signal 
sequences capable of constitutively localizing the candidate expression products to a 
predetermined cellular locale, including a) subcellular locations such as the Golgi, 
endoplasmic reticulum, nucleus, nucleoli, nuclear membrane, mitochondria, 
chloroplast, secretory vesicles, lysosome, and cellular membrane; and b) extracellular 
locations via a secretory signal. Particularly preferred is localization to either 
subcellular locations or to the outside of the cell via secretion. 

2 0 In a preferred embodiment, the targeting sequence is a nuclear localization signal 

(NILS). NLSs are generally short, positively charged (basic) domains that serve to 
direct the entire protein in which they occur to the cell's nucleus. Numerous NLS 
amino acid sequences have been reported including single basic NLS's such as that of 
the SV40 (monkey virus) large T Antigen (Pro Lys Lys Lys Arg Lys Val), Kalderon 
25 (1984), et al., Cell, 39:499-509; the human retinoic acid receptor-G nuclear localization 

signal (ARRRRP); NFkB p50 (EEVQRKRQKL; Ghosh et ah, Cell 62:1019 (1990); 
NFkB p65 (EEKRKRTYE; Nolan et al., Cell 64:961 (1991); and others (see for 
example Boulikas, J. Cell. Biochem. 55(1):32-58 (1994), hereby incorporated by 
reference) and double basic NLS's exemplified by that of the Xenopus (African clawed 

3 0 toad) protein, nucleoplasms (Ala Val Lys Arg Pro Ala Ala Thr Lys Lys Ala Gly Gin Ala 

Lys Lys Lys Lys Leu Asp), Dingwall, et al., Cell, 30:449-458, 1982 and Dingwall, et al., 
J. Cell Biol., 107:641-849; 1988). Numerous localization studies have demonstrated 
that NLSs incorporated in synthetic peptides or grafted onto reporter proteins not 



10 



15 
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normally targeted to the cell nucleus cause these peptides and reporter proteins to be 
concentrated in the nucleus. See, for example, Dingwall, and Laskey, Ann, Rev. Cell 
Biol., 2:367-390, 1986; Bonnerot, etai., Proc. Natl. Acad. Sci. USA, 84:6795-6799, 
1987; Galileo, et aL, Proc. Natl. Acad. Sci. USA, 87:458-462, 1990. 

5 In a preferred embodiment, the targeting sequence is a membrane anchoring signal 

sequence. This is particularly useful since many parasites and pathogens bind to the 
membrane, in addition to the fact that many intracellular events originate at the plasma 
membrane. Thus, membrane-bound peptide libraries are useful for both the 
identification of important elements in these processes as well as for the discovery of 

10 effective inhibitors. The invention provides methods for presenting the randomized 

expression product extracellularly or in the cytoplasmic space; see Fig 3. For 
extracellular presentation, a membrane anchoring region is provided at the carboxyl 
terminus of the peptide presentation structure. The randomized epression product 
region is expressed on the cell surface and presented to the extracellular space, such 

1 5 that it can bind to other surface molecules (affecting their function) or molecules 

present in the extracellular medium. The binding of such molecules could confer 
function on the cells expressing a peptide that binds the molecule. The cytoplasmic 
region could be neutral or could contain a domain that, when the extracellular 
randomized expression product region is bound, confers a function on the cells 

2 0 (activation of a kinase, phosphatase, binding of other cellular components to effect 

function). Similarly, the randomized expression product-containing region could be 
contained within a cytoplasmic region, and the transmembrane region and extracellular 
region remain constant or have a defined function. 

Membrane-anchoring sequences are well known in the art and are based on the 

2 5 genetic geometry of mammalian transmembrane molecules. Peptides are inserted into 

the membrane based on a signal sequence (designated herein as ssTM) and require a 
hydrophobic transmembrane domain (herein TM). The transmembrane proteins are 
inserted into the membrane such that the regions encoded 5' of the transmembrane 
domain are extracellular and the sequences 3' become intracellular. Of course, if 

3 0 these transmembrane domains are placed 5' of the variable region, they will serve to 

anchor it as an intracellular domain, which may be desirable in some embodiments. 
ssTMs and TMs are known for a wide variety of membrane bound proteins, and these 
sequences may be used accordingly, either as pairs from a particular protein or with 
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each component being taken from a different protein, or alternatively, the sequences 
may be synthetic, and derived entirely from consensus as artificial delivery domains. 

As will be appreciated by those in the art, membrane-anchoring sequences, including 
both ssTM and TM, are known for a wide variety of proteins and any of these may be 
used. Particularly preferred membrane-anchoring sequences include, but are not 
limited to, those derived from CD8, ICAM-2, 1L-8R, CD4 and LFA-1. 

Useful sequences include sequences from: 1) class I integral membrane proteins such 
as IL-2 receptor beta-chain (residues 1-26 are the signal sequence, 241-265 are the 
transmembrane residues; see Hatakeyama et aL, Science 244:551 (1989) and von 
Heijne et al, Eur. J. Biochem. 174:671 (1988)) and insulin receptor beta chain 
(residues 1-27 are the signal, 957-959 are the transmembrane domain and 960-1382 
are the cytoplasmic domain; see Hatakeyama, supra, and Ebina et aL, Cell 40:747 
(1985)); 2) class II integral membrane proteins such as neutral endopeptidase 
(residues 29-51 are the transmembrane domain, 2-28 are the cytoplasmic domain; see 
Matfroy et aL, Biochem. Biophys. Res. Commun. 144:59 (1987)); 3) type ill proteins 
such as human cytochrome P450 NF25 (Hatakeyama, supra); and 4) type IV proteins 
such as human P-glycoprotein (Hatakeyama, supra). Particularly preferred are CD8 
and ICAM-2. For example, the signal sequences from CD8 and ICAM-2 lie at the 
extreme 5' end of the transcript. These consist of the amino acids 1-32 in the case of 
CD8 (MASPLTRFLSLNLLLLGESILGSGEAKPQAP; Nakauchi et al., PNAS USA 
82:5126 (1985) and 1-21 in the case of ICAM-2 (MSSFGYRTLTVALFTLICCPG; 
Staunton et al., Nature (London) 339:61 (1989)). These leader sequences deliver the 
construct to the membrane while the hydrophobic transmembrane domains, placed 3* 
of the random candidate region, serve to anchor the construct in the membrane. 
These transmembrane domains are encompassed by amino acids 145-195 from CD8 
(PQRPEDCRPRGSVKGTGLDFACDIYIWAPLAGICVALLLSLIITLICYHSR; Nakauchi, 
supra) and 224-256 from ICAM-2 (MVHVTWSVLLSLFVTSVLLCFIFGQHLRQQR; 
Staunton, supra). 

Alternatively, membrane anchoring sequences include the GPi anchor, which results in 
a covalent bond between the molecule and the lipid bilayer via a glycosyl- 
phosphatidylinositol bond for example in DAF 

(PNKGSGTTSGTTRLLSGHTCFTLTGLLGTLVTMGLLT, with the bolded serine the 
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site of the anchor; see Homans et al. ( Nature 333(61 70):269-72 (1988), and Moran et 
al. f J. Biol. Chem. 266:1250 (1991)). In order to do this, the GPI sequence from Thy-1 
can be cassetted 3' of the variable region in place of a transmembrane sequence. 

Similarly, myristylation sequences can serve as membrane anchoring sequences. It is 
5 known that the myristylation of c-src recruits it to the plasma membrane. This is a 

simple and effective method of membrane localization, given that the first 14 amino 
acids of the protein are solely responsible for this function: MGSSKSKPKDPSQR (see 
Cross et al., Mol. Cell. Biol. 4(9):1834 (1984); Spencer et al., Science 262:1019-1024 
(1993), both of which are hereby incorporated by reference). This motif has already 

1 0 been shown to be effective in the localization of reporter genes and can be used to 

anchor the zeta chain of the TCR. This motif is placed 5' of the variable region in order 
to localize the construct to the plasma membrane. Other modifications such as 
palmitoylation can be used to anchor constructs in the plasma membrane; for example, 
palmitoylation sequences from the G protein-coupled receptor kinase GRK6 sequence 

15 (LLQRLFSRQDCCGNCSDSEEELPTRL, with the bold cysteines being palmitolyated; 

Stoffel et al., J. Biol. Chem 269:27791 (1994)); from rhodopsin 
(KQFRNCMLTSLCCGKNPLGD; Barnstable et al., J. Mol. Neurosci. 5(3):207 (1994)); 
and the p21 H-ras 1 protein (LNPPDESGPGCMSCKCVLS; Capon etal., Nature 
302:33 (1983)). 

20 In a preferred embodiment, the targeting sequence is a lysozomal targeting sequence, 

including, for example, a lysosomal degradation sequence such as Lamp-2 (KFERQ; 
Dice, Ann. N.Y. Acad. Sci. 674:58 (1992); or lysosomal membrane sequences from 
Lamp-1 (MUPIAGFFALAGL VLIVLIA VL/fiRKBSHASYQII, Uthayakumar et al., Cell. 
Mol. Biol. Res. 41:405 (1995)) or Lamp-2 

2 5 (/ VPtAVGAAl AGVl /LVLL^ VF/G LKHH HAG YEQF . Konecki et la., Biochem. Biophys. 

Res. Comm. 205:1-5 (1994), both of which show the transmembrane domains in italics 
and the cytoplasmic targeting signal underlined). 

Alternatively, the targeting sequence may be a mitochondrial localization sequence, 
including mitochondrial matrix sequences (e.g. yeast alcohol dehydrogenase III; 

3 0 MLRTSSLFTRRVQPSLFSRNILRLQST; Schatz, Eur. J. Biochem. 165:1-6 (1987)); 

mitochondrial inner membrane sequences (yeast cytochrome c oxidase subunit IV; 
MLSLRQSIRFFKPATRTLCSSRYLL; Schatz, supra); mitochondrial intermembrane 
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space sequences (yeast cytochrome d; 

MFSMLSKRWAQRTLSKSFYSTATGAASKSGKLTQKLVTAGVAAAGITASTLLYADSLT 
AEAMTA; Schatz, supra) or mitochondrial outer membrane sequences (yeast 70 kD 
outer membrane protein; 

MKSFITRNKTAILATVAATGTAIGAYYYYNQLQQQQQRGKK; Schatz, supra). 

The target sequences may also be endoplasmic reticulum sequences, including the 
sequences from calreticulin (KDEL; Pelham, Royal Society London Transactions B; 1- 
10 (1992)) or adenovirus E3/19K protein (LYLSRRSFIDEKKMP; Jackson et al., EMBO 
J. 9:3153(1990). 

Furthermore, targeting sequences also include peroxisome sequences (for example, 
the peroxisome matrix sequence from Luciferase; SKL; Keller et al., PNAS USA 
4:3264 (1987)); farnesylation sequences (for example, P21 H-ras 1; 
LNPPDESGPGCMSCKCVLS, with the bold cysteine farnesylated; Capon, supra); 
geranylgeranylation sequences (for example, protein rab-5A; LTEPTQPTRNQCCSN. 
with the bold cysteines geranylgeranylated; Farnsworth, PNAS USA 91:11963 (1994)); 
or destruction sequences (cyclin B1; RTALGDIGN; Klotzbucher et al., EMBO J. 1:3063 
(1996)). 

In a preferred embodiment, the targeting sequence is a secretory signal sequence 
capable of effecting the secretion of the candidate translation product. There are a 
large number of known secretory signal sequences which are placed 5* to the variable 
peptide region, and are cleaved from the peptide region to effect secretion into the 
extracellular space. Secretory signal sequences and their transferability to unrelated 
proteins are well known, e.g., Silhavy, et al. (1985) Microbiol. Rev. 49, 398-418. This 
is particularly useful to generate a peptide capable of binding to the surface of, or 
affecting the physiology of, a target cell that is other than the host cell, e.g., the cell 
infected with the retrovirus. In a preferred approach, a fusion product is configured to 
contain, in series, secretion signal peptide-presentation structure-randomized 
expression product region-presentation structure, see Figure 3. In this manner, target 
cells grown in the vicinity of cells caused to express the library of peptides, are bathed 
in secreted peptide. Target cells exhibiting a physiological change in response to the 
presence of a peptide, e.g., by the peptide binding to a surface receptor or by being 
internalized and binding to intracellular targets, and the secreting cells are localized by 
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any of a variety of selection schemes and the peptide causing the effect determined. 
Exemplary effects include variously that of a designer cytokine (i.e., a stem cell factor 
capable of causing hematopoietic stem cells to divide and maintain their totipotential), 
a factor causing cancer cells to undergo spontaneous apoptosis, a factor that binds to 
5 the cell surface of target cells and labels them specifically, etc. 

Suitable secretory sequences are known, including signals from IL-2 
(MYRMQLLSCIALSLALVTNS; Villinger et al., J. Immunol. 155:3946 (1995)), growth 
hormone (MATGSRTSLLLAFGLLCLPWLQEGSAFPT; Roskam et al., Nucleic Acids 
Res. 7:30 (1979)); preproinsulin (MAL\A/MRLLPLLALLALWGPDPAAAEVbt; Belletal., 
10 Nature 284:26 (1980)); and influenza HA protein (M KAKLLVLLY AFVAGBQi, 

Sekiwawa et al., PNAS 80:3563)), with cleavage between the non-underlined- 
underlined junction. A particularly preferred secretory signal sequence is the signal 
leader sequence from the secreted cytokine IL-4, which comprises the first 24 amino 
acids of IL-4 as follows: MGLTSQLLPPLFFLLACAGNFVHG. 

15 In a preferred embodiment, the fusion partner is a rescue sequence. A rescue 

sequence is a sequence which may be used to purify or isolate either the candidate 
agent or the nucleic acid encoding it. Thus, for example, peptide rescue sequences 
include purification sequences such as the His 6 tag for use with Ni affinity columns and 
epitope tags for detection, immunoprecipitation or FACS (fluoroscence-activated cell 

2 0 sorting). Suitable epitope tags include myc (for use with the commercially available 

9E10 antibody), the BSP biotinylation target sequence of the bacterial enzyme BirA, flu 
tags, lacZ, and GST. 

Alternatively, the rescue sequence may be a unique oligonucleotide sequence which 
serves as a probe target site to allow the quick and easy isolation of the retroviral 
2 5 construct, via PCR, related techniques, or hybridization. 

In a preferred embodiment, the fusion partner is a stability sequence to confer stability 
to the candidate bioactive agent or the nucleic acid encoding it. Thus, for example, 
peptides may be stabilized by the incorporation of glycines after the initiation 
methionine (MG or MGGO), for protection of the peptide to ubiquitination as per 
30 Varshavskys N-End Rule, thus conferring long half-life in the cytoplasm. Similarly, two 

prolines at the C-terminus impart peptides that are largely resistant to 
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carboxypeptidase action. The presence of two glycines prior to the prolines impart 
both flexibility and prevent structure initiating events in the di-proline to be propagated 
into the candidate peptide structure. Thus, preferred stability sequences are as 
follows: MG(X) n GGPP, where X is any amino acid and n is an integer of at least four. 

5 In one embodiment, the fusion partner is a dimerization sequence. A dimerization 

sequence allows the non-covalent association of one random peptide to another 
random peptide, with sufficient affinity to remain associated under normal physiological 
conditions. This effectively allows small libraries of random peptides (for example, 10 4 ) 
to become large libraries if two peptides per cell are generated which then dimerize, to 
10 form an effective library of 10 8 (10 4 X 10 4 ). It also allows the formation of longer 

random peptides, if needed, or more structurally complex random peptide molecules. 
The dimers may be homo- or heterodimers. 

Dimerization sequences may be a single sequence that self-aggregates, or two 
sequences, each of which is generated in a different retroviral construct. That is, 
15 nucleic acids encoding both a first random peptide with dimerization sequence 1, and a 

second random peptide with dimerization sequence 2, such that upon introduction into 
a cell and expression of the nucleic acid, dimerization sequence 1 associates with 
dimerization sequence 2 to form a new random peptide structure. 

Suitable dimerization sequences will encompass a wide variety of sequences. Any 
2 0 number of protein-protein interaction sites are known. In addition, dimerization 

sequences may also be elucidated using standard methods such as the yeast two 
hybrid system, traditional biochemical affinity binding studies, or even using the 
present methods. 

The fusion partners may be placed anywhere (i.e. N-terminal, C-terminal, internal) in 

2 5 the structure as the biology and activity permits. 

In a preferred embodiment the fusion partner includes a linker or tethering sequence. 
Linker sequences between various targeting sequences (for example, membrane 
targeting sequences) and the other components of the constructs (such as the 
randomized candidate agents) may be desirable to allow the candidate agents to 

3 0 interact with potential targets unhindered. For example, when the candidate bioactive 
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agent is a peptide, useful linkers include glycine-serine polymers (including, for 
example, (GS) n , (GSGGS) n and (GGGS) n , where n is an integer of at least one), 
glycine-alanine polymers, alanine-serine polymers, and other flexible linkers such as 
the tether for the shaker potassium channel, and a large variety of other flexible 
5 linkers, as will be appreciated by those in the art. Glycine-serine polymers are 

preferred since both of these amino acids are relatively unstructured, and therefore 
may be able to serve as a neutral tether between components. Secondly, serine is 
hydrophilic and therefore able to solubiiize what could be a globular glycine chain. 
Third, similar chains have been shown to be effective in joining subunits of 
1 0 recombinant proteins such as single chain antibodies. 

In addition, the fusion partners, including presentation structures, may be modified, 
randomized, and/or matured to alter the presentation orientation of the randomized 
expression product. For example, determinants at the base of the loop may be 
modified to slightly modify the internal loop peptide tertiary structure, which maintaining 
15 the randomized amino acid sequence. 

In a preferred embodiment, combinations of fusion partners are used. Thus, for 
example, any number of combinations of presentation structures, targeting sequences, 
rescue sequences, and stability sequences may be used, with or without linker 
sequences. As is more fully described below, using a base vector that contains a 

20 cloning site for receiving random and/or biased libraries, one can cassette in various 

fusion partners 5' and 3' of the library. Table 1 outlines some of the possible 
combinations (without specifying the presentation structures) as follows. Using V as 
the variable cloning site for the random nucleic acid libraries, and representing each 
fusion partner by another letter, (i.e. N for nuclear localization sequence) each 

25 construct can be named as a string of representative letters reading 5' to 3' read as 

nucleic acid or N-terminal to C-terminal read as protein, such as NV or if cloned 
downstream of the variable region, VN. As implied here, the fusion partner sequences 
are cloned as cassettes into sites on either side of the variable region. C is for 
cytoplasmic (i.e. no localization sequence), E is a rescue sequence such as the myc 

3 0 epitope, G is a linker sequence (G10 is a glycine-serine chain of 10 amino acids, and 

G20 is a glycine-serine chain of 20 amino acids), M is a myristylation sequence, N is a 
nuclear localization sequence, ssTM is the signal sequence for a transmembrane 
anchoring sequence, TM is the transmembrane anchoring sequence, GPI is a GPI 
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membrane anchor sequence; S is a secretory signal sequence, etc. As will be 
appreciated by those in the art, any number of combinations can be made, in addition 
to those listed below. 



Table 1 



cytoplasmic 


C V 




CE V 




C VE 


secreted 


S V 




S E V 




S VE 


myristylated 


MV 




ME V 




M E G20 V 


transmembrane (intracellular) 


SSTMV 




ssTM V TM 




ssTM V E TM 




ssTM V G20 E TM 




ssTM V E 


transmembrane (GPI linked) 


ssTM V G E TM 


nuclear localization 


N E V 




N VE 



As will be appreciated by those in the art, these modules of sequences can be used in 
a large number of combinations and variations. In addition, as discussed herein, it is 
possible to have more than one variable region in a construct, either to together form a 
new surface or to bring two other molecules together. 



15 In a preferred embodiment, a candidate bioactive agent linked to a presentation 

structure is added at the variable region cloning site, V, above. Alternatively, no 
presentation structure is used, giving a "free" or "non-constrained" peptide or 
expression product. 

Preferred embodiments include the following: 
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a) intracellular, membrane-anchored, linked (i.e. tethered) free peptide: 

MRPI AGQgHTMASPLTRFLSLNLLLLGEStlLGSG PQRPEDCRPRGSVKGTGLDFAC 

DIYIWAPLAGICVALLLSLIITLICYHSR -GSGGSGSGGSGSGGSGSGGSGSGGSGGG- 
(X) n -GGPP, with the secretion signal from murine CD8 in bold, the transmembrane re- 
5 gion of CD8 in underline, and the linker, to provide flexibility (glycine) and solubility 

(serine) in italics. (X) n represents the random peptide, where n is an interger greater 
than about six. A preferred embodiment utilizing this structure utilizes biased peptides, 
as described below, for example using biased SH-3 domain-binding peptide libraries in 
the non-constrained peptide structures, since a number of surface receptor signaling 
10 systems employ SH-3 domains as part of the signaling apparatus. 

b) intracellular, membrane-achored, linked coiled coil: 

MRPI A^fiFMTMASPLTRFLSLrMLLLLGESllLGSG PQRPEDCRPRGSVKGTGLDFAC 

DIYIWAPLAGICVALLLSLIITLICYHSR GSGGSGSGGSGSGGSGSGGSGSGGSGGGC 
AALESEVSALESEVASLESEVAAL -(X)~- LAA VKSKLSA VKSKLA S VKSKLAA CG P P . with 
15 the coiled-coil structure shown in underlined italics. 

c) surface-tethered extracellular, non-constrained: 
MRPLAGGEHTMASPLTRFLSLNLLLLGESHLGSGGG-(X) n - 

GGSGGSGSGGSGSGGSGSGGSGSGGSGG GEQBEEBQBEBSSy}^^ 
IWAPLAGICVALLL SLI1TLICYHSRGGPP. 
20 d) surface-tethered, extracellular constrained: 

MRPI AftftPHTMAftPI TRFI SLNLU IGttllLG&GGGCAALESEVSALESEVASLESE 

VML <X) n - 

LAAVKSKLSAVKSKLASVKSKLAAC GGSGGSGSGGSGSGGSGSGGSGSGGSGGGR 
QRPEDCRPRGSVKGTGLDFACP IYIWAPLAGICVALLLSLIITLICYHSRGGPP. 

25 e) secreted, non-constrained: 

MRPLAGGEHTMASPLTRFLSLNLLLLGESIILGSGGG-(X) n -GGPP 

f) secreted, constrained: 

MRPLAGGEHTMASPLTRFLSLNLLLLGESIIL^^ 

AM -(X) n -j AAVK&KLSAVKSKLASVKSKLAACGPP. 

3 0 The candidate bioactive agents as described above are encoded by candidate nucleic 

acids. By "candidate nucleic acids" herein is meant a nucleic acid, generally RNA 
when retroviral delivery vehicles are used, which can be expressed to form candidate 
bioactive agents; that is, the candidate nucleic acids encode the candidate bioactive 
agents and the fusion partners, if present. In addition, the candidate nucleic acids will 
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also generally contain enough extra sequence to effect translation or transcription, as 
necessary. For a peptide library, the candidate nucleic acid generally contains cloning 
sites which are placed to allow in frame expression of the randomized peptides, and 
any fusion partners, if present, such as presentation structures. For example, when 
5 presentation structures are used, the presentation structure will generally contain the 

initating ATG, as a part of the parent vector. For a RNA library, the candidate nucleic 
acids are generally constructed with an internal CMV promoter, tRNA promoter or cell 
specific promoter designed for immediate and appropriate expression of the RNA 
structure at the initiation site of RNA synthesis. The RNA is expressed anti-sense to 
1 0 the direction of retroviral synthesis and is terminated as known, for example with an 

orientation specific terminator sequence. Interference from upstream transcription is 
alleviated in the target cell with the self-inactivation deletion, a common feature of 
certain retroviral expression systems. 

Generally, the candidate nucleic acids are expressed within the cells to produce 
15 expression products of the candidate nucleic acids. As outlined above, the expression 

products include translation products, i.e. peptides, or transcription products, i.e. 
nucleic acid. 



The candidate bioactive agents and candidate nucleic acids are randomized, either 
fully randomized or they are biased in their randomization, e.g. in nucleotide/residue 

2 0 frequency generally or per position. By "randomized" or grammatical equivalents 

herein is meant that each nucleic acid and peptide consists of essentially random 
nucleotides and amino acids, respectively. As is more fully described below, the 
candidate nucleic acids which give rise to the candidate expression products are 
chemically synthesized, and thus may incorporate any nucleotide at any position. 

2 5 Thus, when the candidate nucleic acids are expressed to form peptides, any amino 

acid residue may be incorporated at any position. The synthetic process can be 
designed to generate randomized nucleic acids, to allow the formation of all or most of 
the possible combinations over the length of the nucleic acid, thus forming a library of 
randomized candidate nucleic acids. 



30 



The library should provide a sufficiently structurally diverse population of randmized 
expression products to effect a probabilistically sufficient range of cellular responses to 
provide one or more cells exhibiting a desired response. Accordingly, an interaction 
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library must be large enough so that at least one of its members will have a structure 
that gives it affinity for some molecule, protein, or other factor whose activity is 
necessary for completion of the signaling pathway. Although it is difficult to gauge the 
required absolute size of an interaction library, nature provides a hint with the immune 
5 response: a diversity of 10 7 -10 8 different antibodies provides at least one combination 

with sufficient affinity to interact with most potential antigens faced by an organism. 
Published in vitro selection techniques have also shown that a library size of 10 7 to 10 8 
is sufficient to find structures with affinity for the target. A library of all combinations of 
a peptide 7 to 20 amino acids in length, such as proposed here for expression in retro- 

10 viruses, has the potential to code for 20 7 (10 9 ) to 20 20 . Thus, with libraries of 10 7 to 10 8 

per ml of retroviral particles the present methods allow a "working" subset of a 
theoretically complete interaction library for 7 amino acids, and a subset of shapes for 
the 20 20 library. Thus, in a preferred embodiment, at least 10 6 , preferably at least 10 7 , 
more preferably at least 10 8 and most preferably at least 10 9 different expression 

1 5 products are simultaneously analyzed in the subject methods. Preferred methods 

maximize library size and diversity. 

It is important to understand that in any library system encoded by oligonucleotide 
synthesis one cannot have complete control over the codons that will eventually be 
incorporated into the peptide structure. This is especially true in the case of codons 

2 0 encoding stop signals (TAA, TGA, TAG). In a synthesis with NNN as the random 

region, there is a 3/64, or 4.69%, chance that the codon will be a stop codon. Thus, in 
a peptide of 10 residues, there is an unacceptable high likelihood that 46.7% of the 
peptides will prematurely terminate. For free peptide structures this is perhaps not a 
problem. But for larger structures, such as those envisioned here, such termination 
25 will lead to sterile peptide expression. To alleviate this, random residues are encoded 

as NNK, where K= T or G. This allows for encoding of all potential amino acids 
(changing their relative representation slightly), but importantly preventing the encoding 
of two stop residues TAA and TGA. Thus, libraries encoding a 10 amino acid peptide 
will have a 15.6% chance to terminate prematurely. For candidate nucleic acids which 

3 0 are not designed to result in peptide expression products, this is not necessary. 



In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, 
some positions within the sequence are either held constant, or are selected from a 
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limited number of possibilities. For example, in a preferred embodiment, the 
nucleotides or amino acid residues are randomized within a defined class, for example, 
of hydrophobic amino acids, hydrophilic residues, sterically biased (either small or 
large) residues, towards the creation of cysteines, for cross-linking, prolines for SH-3 
5 domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or 

to purines, etc. 

In a preferred embodiment, the bias is towards peptides or nucleic acids that interact 
with known classes of molecules. For example, when the candidate bioactive agent is 
a peptide, it is known that much of intracellular signaling is carried out via short regions 

10 of polypeptides interacting with other polypeptides through small peptide domains. For 

instance, a short region from the HIV-1 envelope cytoplasmic domain has been 
previously shown to block the action of cellular calmodulin. Regions of the Fas 
cytoplasmic domain, which shows homology to the mastoparan toxin from Wasps, can 
be limited to a short peptide region with death-inducing apoptotic or G protein inducing 

15 functions. Magainin, a natural peptide derived from Xenopus, can have potent anti- 

tumour and anti-microbial activity. Short peptide fragments of a protein kinase C 
isozyme (ftPKC), have been shown to block nuclear translocation of GPKC in Xenopus 
oocytes following stimulation. And, short SH-3 target peptides have been used as 
psuedosubstrates for specific binding to SH-3 proteins. This is of course a short list of 

2 0 available peptides with biological activity, as the literature is dense in this area. Thus, 

there is much precedent for the potential of small peptides to have activity on 
intracellular signaling cascades. In addition, agonists and antagonists of any number 
of molecules may be used as the basis of biased randomization of candidate bioactive 
agents as well. 

25 Thus, a number of molecules or protein domains are suitable as starting points for the 

generation of biased randomized candidate bioactive agents. A large number of small 
molecule domains are known, that confer a common function, structure or affinity. In 
addition, as is appreciated in the art, areas of weak amino acid homology may have 
strong structural homology. A number of these molecules, domains, and/or 

3 0 corresponding consensus sequences, are known, including, but are not limited to, SH- 

2 domains, SH-3 domains, Pleckstrin, death domains, protease cleavage/recognition 
sites, enzyme inhibitors, enzyme substrates, Traf, etc. Similarly, there are a number of 
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known nucleic acid binding proteins containing domains suitable for use in the 
invention. For example, leucine zipper consensus sequences are known. 

Where the ultimate expression product is a nucleic acid, at least 10, preferably at least 
12, more preferably at least 15, most preferably at least 21 nucleotide positions need 
5 to be randomized, with more preferable if the randomization is less than perfect. 

Similarly, at least 5, preferably at least 6, more preferably at least 7 amino acid 
positions need to be randomized; again, more are preferable if the randomization is 
less than perfect 

In a preferred embodiment, biased SH-3 domain-binding oligonucleotides/peptides are 
10 made. SH-3 domains have been shown to recognize short target motifs (SH-3 

domain-binding peptides), about ten to twelve residues in a linear sequence, that can 
be encoded as short peptides with high affinity for the target SH-3 domain. Consensus 
sequences for SH-3 domain binding proteins have been proposed. Thus, in a 
preferred embodiment, oligos/peptides are made with the following biases 
15 1 . XXXPPXPXX, wherein X is a randomized residue. 

2. (within the positions of residue positions 11 to -2): 







11 10 


9 


8 


7 


6 5 


4 


3 


2 


1 




Met 


GlyaallaalO 


aa9 


aa8 


aa7 


Arg Pro 


Leu 


Pro 


Pro 


hyd 




0 


-1 -2 


















20 


Pro 


hyd hyd Gly 


Gly 


Pro 


Pro 


STOP 












atg 


ggc nnk nnk 


nnk 


nnk 


nnk 


aga cct 


ctg 


cct 


cca 


sbk 




ggg 


sbk sbk gga 


ggc 


cca 


cct 


TAA1. 











In this embodiment, the N-terminus flanking region is suggested to have the greatest 
effects on binding affinity and is therefore entirely randomized. "Hyd" indicates a bias 
25 toward a hydrophobic residue, i.e.- Val, Ala, Gly, Leu, Pro, Arg. To encode a 

hydrophobically biased residue, "sbk" codon biased structure is used. Examination of 
the codons within the genetic code will ensure this encodes generally hydrophobic 
residues. s= g,c; b= t, g, c; v= a, g, c; m= a, c; k= t g; n= a, t, g, c. 
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The candidate nucleic acids are introduced into the cells to screen for transdominant 
bioactive agents capable of altering the phenotype of a cell By "introduced into " or 
grammatical equivalents herein is meant that the nucleic acids enter the cells in a 
manner suitable for subsequent expression of the nucleic acid. The method of 
5 introduction is largely dictated by the targeted cell type, discussed below. Exemplary 

methods include CaP0 4 precipitation, liposome fusion, lipofectin®, electroporation, 
viral infection, etc. The candidate nucleic acids may stably integrate into the genome 
of the host cell (for example, with retroviral introduction, outlined below), or may exist 
either transiently or stably in the cytoplasm (i.e. through the use of traditional plasmids, 
10 utilizing standard regulatory sequences, selection markers, etc.). As many 

pharmaceutical^ important screens require human or model mammalian cell targets, 
retroviral vectors capable of transfecting such targets are preferred. 

in a preferred embodiment, the candidate nucleic acids are part of a retroviral particle 
which infects the cells. Generally, infection of the cells is straightforward with the 
15 application of the infection-enhancing reagent polybrene, which is a polycation that 

facilitates viral binding to the target cell. Infection can be optimized such that each cell 
generally expresses a single construct, using the ratio of virus particles to number of 
cells. Infection follows a Poisson distribution. 

In a preferred embodiment, the candidate nucleic acids are introduced into the cells 

2 0 using retroviral vectors. Currently, the most efficient gene transfer methodologies 

harness the capacity of engineered viruses, such as retroviruses, to bypass natural 
cellular barriers to exogenous nucleic acid uptake. The use of recombinant retroviruses 
was pioneered by Richard Mulligan and David Baltimore with the Psi-2 lines and 
analogous retrovirus packaging systems, based on NIH 3T3 cells (see Mann et al. t 
25 Cell 33:153-159 (1993), hereby incorporated by reference). Such helper-defective 

packaging lines are capable of producing all the necessary trans proteins -gag, pol, 
and env- that are required for packaging, processing, reverse transcription, and 
integration of recombinant genomes. Those RNA molecules that have in cis the y 
packaging signal are packaged into maturing virions. Retroviruses are preferred for a 

3 0 number of reasons. First, their derivation is easy. Second, unlike Adenovirus-medi- 

ated gene delivery, expression from retroviruses is long-term (adenoviruses do not 
integrate). Adeno-associated viruses have limited space for genes and regulatory 
units and there is some controversy as to their ability to integrate. Retroviruses 
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therefore offer the best current compromise in terms of long-term expression, genomic 
flexibility, and stable integration, among other features. The main advantage of 
retroviruses is that their integration into the host genome allows for their stable 
transmission through cell division. This ensures that in cell types which undergo 
5 multiple independent maturation steps, such as hematopoietic cell progression, the 

retrovirus construct will remain resident and continue to express. 



A particularly well suited retroviral transfection system is described in Mann et al., 
supra: Pear et al., PNAS USA 90(18):8392-6 (1993); Kitamura et al., PNAS USA 
92:9146-9150 (1995); Kinsella et al., Human Gene Therapy 7:1405-1413; Hofmann et 
10 al., PNAS USA 93:5185-5190; Choate et al., Human Gene Therapy 7:2247 (1996); and 

WO 94/19478; and references cited therein, all of which are incorporated by reference. 

In one embodiment of the invention, the library is generated in a retrovirus DNA 
construct backbone, as is generally described in the examples. Standard 
oligonucleotide synthesis is done to generate the random portion of the candidate 

15 bioactive agent, using techniques well known in the art (see Eckstein, Oligonucleotides 

and Analogues, A Practical Approach, IRL Press at Oxford University Press, 1991); 
libraries may be commercially purchased. Libraries with up to 10 9 unique sequences 
can be readily generated in such DNA backbones. After generation of the DNA library, 
the library is cloned into a first primer. The first primer serves as a "cassette", which is 

2 0 inserted into the retroviral construct. The first primer generally contains a number of 

elements, including for example, the required regulatory sequences (e.g. translation, 
transcription, promoters, etc), fusion partners, restriction endonuclease (cloning and 
subcloning) sites, stop codons (preferably in all three frames), regions of 
complementarity for second strand priming (preferably at the end of the stop codon 

2 5 region as minor deletions or insertions may occur in the random region), etc. 



A second primer is then added, which generally consists of some or all of the 
complementarity region to prime the first primer and optional necessary sequences for 
a second unique restriction site for subcloning. DNA polymerase is added to make 
double-stranded oligonucleotides. The double-stranded oligonucleotides are cleaved 
3 0 with the appropriate subcloning restriction endonucleases and subcloned into the 

target retroviral vectors, described below. 
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Any number of suitable retroviral vectors may be used. Generally, the retroviral 
vectors may include: selectable marker genes under the control of internal ribosome 
entry sites (IRES), which allows for bicistronic operons and thus greatly facilitates the 
selection of cells expressing peptides at uniformly high levels; and promoters driving 
5 expression of a second gene, placed in sense or anti-sense relative to the 5* LTR. 

Suitable selection genes include, but are not limited to, neomycin, blastocidin, 
bleomycin, puromycin, and hygromycin resistance genes, as well as self-fluorescent 
markers such as green fluoroscent protein, enzymatic markers such as lacZ, and 
surface proteins such as CD8, etc. 

10 Preferred vectors include a vector based on the murine stem cell virus (MSCV) (see 

Hawley et al., Gene Therapy 1:136 (1994)) and a modified MFG virus (Rivere et al., 
Genetics 92:6733 (1995)), and pBABE, outlined in the examples. A general schematic 
of the retroviral construct is depicted in Figure 4. 

The retroviruses may include inducible and constitutive promoters. For example, there 
are situations wherein it is necessary to induce peptide expression only during certain 
phases of the selection process. For instance, a scheme to provide pro-inflammatory 
cytokines in certain instances must include induced expression of the peptides. This is 
because there is some expectation that over-expressed pro-inflammatory drugs might 
in the long-term be detrimental to cell growth. Accordingly, constitutive expression is 
undesirable, and the peptide is only turned on during that phase of the selection 
process when the phenotype is required, and then shut the peptide down by turning off 
the retroviral expression to confirm the effect or ensure long-term survival of the 
producer cells. A large number of both inducible and constitutive promoters are 
known. 

25 In addition, it is possible to configure a retroviral vector to allow inducible expression of 

retroviral inserts after integration of a single vector in target cells; importantly, the 
entire system is contained within the single retrovirus. Tet-inducible retroviruses have 
been diesgned incorporating the Self-Inactivating (SIN) feature of 3' LTR 
enhancer/promoter retroviral deletion mutant (Hoffman et al., PNAS USA 93:5185 

3 0 (1996)). Expression of this vector in cells is virtually undetectable in the presence of 

tetracycline or other active analogs. However, in the absence of Tet, expression is 
turned on to maximum within 48 hours after induction, with uniform increased 
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expression of the whole population of cells that harbor the inducible retrovirus, 
indicating that expression is regulated uniformly within the infected cell population. A 
similar, related system uses a mutated Tet DNA-binding domain such that it bound 
DNA in the presence of Tet, and was removed in the absence of Tet. Either of these 
5 systems is suitable. 

In this manner the primers create a library of fragments, each containing a different 
random nucleotide sequence that may encode a different peptide. The ligation 
products are then transformed into bacteria, such as E. coli, and DNA is prepared from 
the resulting library, as is generally outlined in Kitamura, PNAS USA 92:9146-9150 
10 (1995), hereby expressly incorporated by reference. 

Delivery of the library DNA into a retroviral packaging system results in conversion to 
infectious virus. Suitable retroviral packaging system cell lines include, but are not 
limited to, the Bing and BOSC23 cell lines described in WO 94/19478; Soneoka et al., 
Nucleic Acid Res. 23(4):628 (1995); Finer et al., Blood 83:43 (1994); Pheonix 
15 packaging lines such as PhiNX-eco and PhiNX-ampho, described below; 292T + gag- 

pol and retrovirus envelope; PA317; and cell lines outlined in Markowitz et al., Virology 
167:400 (1988), Markowitz et al., J. Virol. 62:1120 (1988), Li et al., PNAS USA 
93:1 1658 (1996), Kinsella et al., Human Gene Therapy 7:1405 (1996), ail of which are 
incorporated by reference. 

2 0 Preferred systems include PhiNX-eco and PhiNX-ampho or similar ceil lines, which are 

two cells lines as follows. The cell lines are based on the BING and BOSC23 cell lines 
described in WO 94/19478, which are based on the 293T cell line (a human embryonic 
kidney line transformed with adenovirus E1a and carrying a temperature sensitive T 
antigen co-selected with neomycin). The unique feature of this cell line is that it is 

2 5 highly transfectable with either calcium phosphate mediated transfection or lipid-based 

transfection protocols - greater than 50% of 293T cells can be transiently transfected 
with plasmid DNA. Thus, the cell line could be a cellular milieu in which retroviral 
structural proteins and genomic viral RNA could brought together rapidly for creation of 
helper-defective virus. 293T cells were therefore engineered with stably integrated 

3 0 defective constructs capable of producing gag-pol, and envelope protein for either 

ecotropic or amphotropic viruses. These lines were called BOSC23 and Bing, 
respectively. The utility of these lines was that one could produce small amounts of 
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recombinant virus transiently for use in small-scale experimentation. The lines offered 
advantages over previous stable systems in that virus could be produced in days 
rather than months. 

Two problems became apparent with these first generation lines over the two years 
5 they have been in wide use. First, gag-pol and envelope expression was unstable and 

the lines required vigilant checking for retroviral production capacity; second the 
structure of the vectors used for protein production were not considered fully "safe" for 
helper virus production; and third, one of the lines was shown to be inadvertently carry- 
ing a hygromycin-containing retrovirus. Although the BING and BOSC23 lines are 

10 useful in the present invention, all of these potentially problematic issues are 

addressed in the PhiNX second-generation lines. These lines are based on 293T cells 
as well, with the following improvements. First, the ability to monitor gag-pol 
production on a cell-by cell basis was made by introducing an IRES-CD8 surface 
marker expression cassette downstream of the reading frame of the gag-pol construct 

15 (other surface markers besides CD8 are also useful). IRES (internal ribosome entry 

site) sequences allow secondary or tertiary protein translation from a single mRNA 
transcript. Thus, CD8 expression is a direct reflection of intracellular gag-pol and the 
stability of the producer cell population's ability to produce gag-pol can be readily 
monitored by flow cytometry. Second, for both the gag-pol and envelope constructs 

2 0 non-Moloney promoters were used to minimize recombination potential with introduced 

retroviral constructs, and different promoters for gag-pol and envelope were used to 
minimize their inter-recombination potential. The promoters used were CMV and RSV. 
Two cell lines were created, PHEONIX-ECO and PHEONIX-AMPHO. Gag-pol was 
introduced with hygromycin as the co-selectable marker and the envelope proteins 
25 were introduced with diptheria resistance as the co-selectable marker. Finally, the 

cells were screened to find a relatively rare cell type that produced gag-pol and env in 
a uniform distribution, although this is not required. In addition, a line termed 
PHEONIX-gp has been produced that expresses only gag-pol. This line is available 
for further pseudotyping of retroviral virions with other envelope proteins such as 

3 0 gibbon ape leukemia virus envelope or Vesicular Stomatitus VSV-G protein, 

Xenotropic, or retargeting envelopes can also be added. 

Both PHEONIX-ECO and PHEONIX-AMPHO were tested for helper virus production 
and established as being helper-virus free. Both lines can carry episomes for the 
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creation of stable cell lines which can be used to produce retrovirus. Both lines are 
readily testable by flow cytometry for stability of gag-pol (CD8) and envelope ex- 
pression; after several months of testing the lines appear stable, and do not 
demonstrate loss of titre as did the first-generation lines BOSC23 and Bing (partly due 
5 to the choice of promoters driving expression of gag-pol and envelope). Both lines can 

also be used to transiently produce virus in a few days. Thus, these new lines are fully 
compatible with transient, episomal stable, and library generation for retroviral gene 
transfer experiments. Finally, the titres produced by these lines have been tested. 
Using standard polybrene-enhanced retroviral infection, titres approaching or above 
10 1 0 7 per ml were observed for both PHEONIX-eco and PHEONIX-ampho when carrying 

episomal constructs. When transiently produced virus is made, titres are usually 1/2 to 
1/3 that value. 

These lines are helper-virus free, carry episomes for long-term stable production of 
retrovirus, stably produce gag-pol and env, and do not demonstrate loss of viral titre 
15 over time. In additon, PhiNX-eco and PhiNX-ampho are capable of producing titres 

approaching or above 10 7 per ml when carrying episomal constructs, which, with 
concentration of virus, can be enhanced to 10 8 to 10 9 per ml. 

In a preferred embodiment, the cell lines disclosed above, and the other methods for 
producing retrovirus, are useful for production of virus by transient transfection. The 

2 0 virus can either be used directly or be used to infect another retroviral producer cell 

line for "expansion" of the library. 

Concentration of virus may be done as follows. Generally, retroviruses are titred by 
applying retrovirus-containing supernatant onto indicator cells, such as NIH3T3 cells, 
and then measuring the percentage of cells expressing phenotypic consequences of 
25 infection. The concentration of the virus is determined by muttipying the percentage of 

cells infected by the dilution factor involved, and taking into account the number of 
target cells available to obtain a relative titre. If the retrovirus contains a reporter gene, 
such as lacZ, then infection, integration, and expression of the recombinant virus is 
measured by histological staining for lacZ exprssion or by flow cytometry (FACS). In 

3 0 general, retroviral titres generated from even the best of the producer cells do not 

exceed 10 7 per ml, unless concentration by relatively expensive or exotic apparatus. 
However, as it has been recently postulated that since a particle as large as a 
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retrovirus will not move very far by brownian motion in liquid, fluid dynamics predicts 
that much of the virus never comes in contact with the cells to initiate the infection 
process. However, if cells are grown or placed on a porous filter and retrovirus is 
allowed to move past cells by gradual gravitometric flow, a high concentration of virus 
5 around cells can be effectively maintained at all times. Thus, up to a ten-fold higher 

infectivity by infecting cells on a porous membrane and allowing retrovirus supernatant 
to flow past them has been seen. This should allow titres of 1 0 9 after concentration. 



The candidate nucleic acids, as part of the retroviral construct, are introduced into the 
cells to screen for transdominant bioactive agents capable of altering the phenotype of 
10 a cell. 



As will be appreciated by those in the art, the type of cells used in the present invention 
can vary widely. Basically, any mammalian cells may be used, with mouse, rat, 
primate and human cells being particularly preferred, although as will be appreciated 
by those in the art, modifications of the system by pseudotyping allows ail eukaryotic 

15 cells to be used, preferably higher eukaryotes. As is more fully described below, a 

screen will be set up such that the cells exhibit a selectable phenotype in the presence 
of a bioactive agent. As is more fully described below, cell types implicated in a wide 
variety of disease conditions are particularly useful, so long as a suitable screen may 
be designed to allow the selection of cells that exhibit an altered phenotype as a 

2 0 consequence of the presence of a transdominant bioactive agent within the cell. 



Accordingly, suitable cell types include, but are not limited to, tumor cells of all types 
(particularly melanoma, myeloid leukemia, carcinomas of the lung, breast, ovaries, 
colon, kidney, prostate, pancreas and testes), cardiomyocytes, endothelial cells, 
epithelial cells, lymphocytes (T-cell and B cell) , mast cells, eosinophils, vascular 

2 5 intimal cells, hepatocytes, leukocytes including mononuclear leukocytes, stem cells 

such as haemopoetic, neural, skin, lung, kidney, liver and myocyte stem cells (for use 
in screening for differentiation and de-differentiation factors), osteoclasts, chondrocytes 
and other connective tissue cells, keratinocytes, melanocytes, liver cells, kidney cells, 
and adipocytes. Suitable cells also include known research cells, including, but not 

3 0 limited to, Jurkat T cells, NIH3T3 cells, CHO, Cos, etc. See the ATCC cell line catalog, 

hereby expressly incorporated by reference. 
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In one embodiment, the cells may be genetically engineered, that is, contain 
exogeneous nucleic acid, for example, to contain target molecules. 

In a preferred embodiment, a first plurality of cells is screened. That is, the cells into 
which the candidate nucleic acids are introduced are screened for an altered 
5 phenotype. Thus, in this embodiment, the effect of the transdominant bioactive agent 

is seen in the same cells in which it is made; i.e. an autocrine effect. 

By a "plurality of cells" herein is meant roughly from about 10 3 cells to 10 8 or 10 9 , with 
from 10 6 to 10 8 being preferred. This plurality of cells comprises a cellular library, 
wherein generally each cell within the library contains a member of the retroviral 

10 molecular library, i.e. a different candidate nucleic acid, although as will be appreciated 

by those in the art, some cells within the library may not contain a retrovirus, and some 
may contain more than one. When methods other than retroviral infection are used to 
introduce the candidate nucleic acids into a plurality of cells, the distribution of 
candidate nucleic acids within the individual cell members of the cellular library may 

15 vary widely, as it is generally difficult to control the number of nucleic acids which enter 

a cell during electroporation, etc. 

In a preferred embodiment, the candidate nucleic acids are introduced into a first 
plurality of cells, and the effect of the candidate bioactive agents is screened in a 
second or third plurality of cells, different from the first plurality of cells, i.e. generally a 

2 0 different cell type. That is, the effect of the transdominant bioactive agents is due to an 

extracellular effect on a second cell; i.e. an endocrine or paracrine effect. This is done 
using standard techniques. The first plurality of cells may be grown in or on one 
media, and the media is allowed to touch a second plurality of cells, and the effect 
measured. Alternatively, there may be direct contact between the cells. Thus, 

25 "contacting" is functional contact, and includes both direct and indirect, tn this 

embodiment, the first plurality of cells may or may not be screened. 

If necessary, the cells are treated to conditions suitable for the expression of the 
candidate nucleic acids (for example, when inducible promoters are used), to produce 
the candidate expression products, either translation or transcription products. 
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Thus, the methods of the present invention comprise introducing a molecular library of 
randomized candidate nucleic acids into a plurality of cells, a cellular library. Each of 
the nucleic acids comprises a different, generally randomized, nucleotide sequence. 
The plurality of cells is then screened, as is more fully outlined below, for a cell 
5 exhibiting an altered phenotype. The altered phenotype is due to the presence of a 

transdominant bioactive agent 

By "altered phenotype" or "changed physiology" or other grammatical equivalents 
herein is meant that the phenotype of the cell is altered in some way, preferably in 
some detectable and/or measurable way. As will be appreciated in the art, a strength 

10 of the present invention is the wide variety of cell types and potential phenotypic 

changes which may be tested using the present methods. Accordingly, any 
phenotypic change which may be observed, detected, or measured may be the basis 
of the screening methods herein. Suitable phenotypic changes include, but are not 
limited to: gross physical changes such as changes in cell morphology, cell growth, cell 

15 viability, adhesion to substrates or other cells, and cellular density; changes in the 

expression of one or more RNAs, proteins, lipids, hormones, cytokines, or other 
molecules; changes in the equilibrium state (i.e. half-life) or one or more RNAs, 
proteins, lipids, hormones, cytokines, or other molecules; changes in the localization of 
one or more RNAs, proteins, lipids, hormones, cytokines, or other molecules; changes 

2 0 in the bioactivity or specific activity of one or more RNAs, proteins, lipids, hormones, 

cytokines, receptors, or other molecules; changes in the secretion of ions, cytokines, 
hormones, growth factors, or other molecules; alterations in cellular membrane 
potentials, polarization, integrity or transport; changes in infectivity, susceptability, 
latency, adhesion, and uptake of viruses and bacterial pathogens; etc. By "capable of 

2 5 altering the phenotype" herein is meant that the bioactive agent can change the 

phenotype of the cell in some detectable and/or measurable way. 

The altered phenotype may be detected in a wide variety of ways, as is described 
more fully below, and will generally depend and correspond to the phenotype that is 
being changed. Generally, the changed phenotype is detected using, for example: 

3 0 microscopic analysis of cell morphotogy; standard cell viability assays, including both 

increased cell death and increased cell viability, for example, cells that are now 
resistant to cell death via virus, bacteria, or bacterial or synthetic toxins; standard 
labeling assays such as fluorometric indicator assays for the presence or level of a 
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particular cell or molecule, including FACS or other dye staining techniques; 
biochemical detection of the expression of target compounds after killing the cells; etc. 
In some cases, as is more fully described herein, the altered phenotype is detected in 
the cell in which the randomized nucleic acid was introduced; in other embodiments, 
5 the altered phenotype is detected in a second cell which is responding to some 

molecular signal from the first cell. 

An altered phenotype of a cell indicates the presence of a transdominant bioactive 
agent. By "transdominant" herein is meant that the bioactive agent indirectly causes 
the altered phenotype by acting on a second molecule, which leads to an altered 

1 0 phenotype. That is, a transdominant expression product has an effect that is not in cis, 

i.e., a trans event as defined in genetic terms or biochemical terms. A transdominant 
effect is a distinguishable effect by a molecular entity (i.e., the encoded peptide or 
RNA) upon some separate and distinguishable target; that is, not an effect upon the 
encoded entity itself. As such, transdominant effects include many well-known effects 

15 by pharmacologic agents upon target molecules or pathways in cells or physiologic 

systems; for instance, the p-lactam antibiotics have a transdominant effect upon 
peptidoglycan synthesis in bacterial cells by binding to penicillin binding proteins and 
disrupting their functions. An exemplary transdominant effect by a peptide is the ability 
to inhibit NF-kB signaling by binding to IxB-a at a region critical for its function, such 

2 0 that in the presence of sufficient amounts of the peptide (or molecular entity), the 

signaling pathways that normally lead to the activation of NF-kB through 
phosphorylation and/or degradation of IkB-oc are inhibited from acting at IkB-cx 
because of the binding of the peptide or molecular entity. In another instance, 
signaling pathways that are normally activated to secrete IgE are inhibited in the 
25 presence of peptide. Or, signaling pathways in adipose tissue cells, normally 

quiescent, are activated to metabolize fat. Or, in the presence of a peptide, 
intracellular mechanisms for the replication of certain viruses, such as HIV-I, or Herpes 
viridae family members, or Respiratory Syncytia Virus, for example, are inhibited. 

A transdominant effect upon a protein or molecular pathway is clearly distinguishable 

3 0 from randomization, change, or mutation of a sequence within a protein or molecule of 

known or unknown function to enhance or diminish a biochemical ability that protein or 
molecule already manifests. For instance, a protein that enzymatically cleaves 
P-lactam antibiotics, a p-lactamase, could be enhanced or diminished in its activity by 
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mutating sequences internal to its structure that enhance or diminish the ability of this 
enzyme to act upon and cleave P-lactam antibiotics. This would be called a cis 
mutation to the protein. The effect of this protein upon p-lactam antibiotics is an 
activity the protein already manifests, to a distinguishable degree. Similarly, a mutation 
5 in the leader sequence that enhanced the export of this protein to the extracellular 

spaces wherein it might encounter p-lactam molecules more readily, or a mutation 
within the sequence that enhance the stability of the protein, would be termed cis 
mutations in the protein. For comparison, a transdominant effector of this protein 
would include an agent, independent of the P-lactamase, that bound to the p- 
1 0 lactamase in such a way that it enhanced or diminished the function of the p-lactamase 

by virtue of its binding to P-lactamase. 

In general, cis-effects are effects within molecules wherein elements that are 
interacting are covalently joined to each other although these elements might 
individually manifest themselves as separable domains. Trans-effects (transdominant 
15 in that under some cellular conditions the desired effect is manifested) are those 

effects between distinct molecular entities, such that molecular entity A, not covalently 
linked to molecular entity B, binds to or otherwise has an effect upon the activities of 
entity B. As such, most known pharmacological agents are transdominant effectors. 

In a preferred embodiment, once a cell with an altered phenotype is detected, the cell 
20 is isolated from the plurality which do not have altered phenotypes. This may be done 

in any number of ways, as is known in the art, and will in some instances depend on 
the assay or screen. Suitable isolation techniques include, but are not limited to, FACS, 
lysis selection using complement, cell cloning, scanning by Fluorimager, expression of 
a "survival" protein, induced expression of a cell surface protein or other molecule that 
25 can be rendered fluorescent or taggable for physical isolation; expression of an 

enzyme that changes a non-fluorescent molecule to a fluorescent one; overgrowth 
against a background of no or slow growth; death of cells and isolation of DNA or other 
cell vitality indicator dyes, etc. 

In a preferred embodiment, the candidate nucleic acid and/or the bioactive agent is 
3 0 isolated from the positive cell. This may be done in a number of ways. In a preferred 

embodiment, primers complementary to DNA regions common to the retroviral 
constructs, or to specific components of the library such as a rescue sequence, 
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defined above, are used to "rescue" the unique random sequence. Alternatively, the 
bioactive agent is isolated using a rescue sequence. Thus, for example, rescue 
sequences comprising epitope tags or purification sequences may be used to pull out 
the bioactive agent, using immunoprecipitation or affinity columns. In some instances, 
as is outlined below, this may also pull out the primary target molecule, if there is a 
sufficiently strong binding interaction between the bioactive agent and the target 
molecule. Alternatively, the peptide may be detected using mass spectroscopy. 

Once rescued, the sequence of the bioactive agent and/or bioactive nucleic acid is 
determined. This information can then be used in a number of ways. 

In a preferred embodiment, the bioactive agent is resynthesized and reintroduced into 
the target cells, to verify the effect. This may be done using retroviruses, or 
alternatively using fusions to the HIV-1 Tat protein, and analogs and related proteins, 
which allows very high uptake into target cells. See for example, Fawell et aL, PNAS 
USA 91:664 (1994); Frankel et al., Cell 55:1 189 (1988); Savion et al., J. Biol. Chem. 
256:1149 (1981); Derossi etal., J. Biol. Chem. 269:10444 (1994); and Baldin et al, 
EMBO J. 9:1511 (1990), all of which are incorporated by reference. 

In a preferred embodiment, the sequence of a bioactive agent is used to generate 
more candidate bioactive agents. For example, the sequence of the bioactive agent 
may be the basis of a second round of (biased) randomization, to develop bioactive 
agents with increased or altered activities. Alternatively, the second round of 
randomization may change the affinity of the bioactive agent. Furthermore, it may be 
desirable to put the identified random region of the bioactive agent into other 
presentation structures, or to alter the sequence of the constant region of the 
presentation structure, to alter the conformation/shape of the bioactive agent. It may 
also be desirable to "walk" around a potential binding site, in a manner similar to the 
mutagenesis of a binding pocket, by keeping one end of the ligand region constant and 
randomizing the other end to shift the binding of the peptide around. 

In a preferred embodiment, either the bioactive agent or the bioactive nucleic acid 
encoding it is used to identify target molecules, i.e. the molecules with which the 
bioactive agent interacts. As will be appreciated by those in the art, there may be 
primary target molecules, to which the bioactive agent binds or acts upon directly, and 



there may be secondary target molecules, which are part of the signalling pathway 
affected by the bioactive agent; these might be termed "validated targets". 

In a preferred embodiment, the bioactive agent is used to pull out target molecules. 
For example, as outlined herein, if the target molecules are proteins, the use of epitope 
tags or purification sequences can allow the purification of primary target molecules via 
biochemical means (co-immunoprecipitation t affinity columns, etc.). Alternatively, the 
peptide, when expressed in bacteria and purified, can be used as a probe against a 
bacterial cDNA expression library made from mRNA of the target cell type. Or, 
peptides can be used as "bait" in either yeast or mammalian two or three hybrid 
systems. Such interaction cloning approaches have been very useful to isolate DNA- 
binding proteins and other interacting protein components. The peptide(s) can be 
combined with other pharmacologic activators to study the epistatic relationships of 
signal transduction pathways in question. It is also possible to synthetically prepare 
labeled peptide bioactive agent and use it to screen a cDNA library expressed in 
bacteriophage for those cDNAs which bind the peptide. Furthermore, it is also 
possible that one could use cDNA cloning via retroviral libraries to "complement" the 
effect induced by the peptide. In such a strategy, the peptide would be required to be 
stochiometrically titrating away some important factor for a specific signaling pathway. 
If this molecule or activity is replenished by over-expression of a cDNA from within a 
cDNA library, then one can clone the target. Similarly, cDNAs cloned by any of the 
above yeast or bacteriophage systems can be reintroduced to mammalian cells in this 
manner to confirm that they act to complement function in the system the peptide acts 
upon. 

Once primary target molecules have been identified, secondary target molecules may 
be identified in the same manner, using the primary target as the "bait". In this 
manner, signalling pathways may be elucidated. Similarly, bioactive agents specific for 
secondary target molecules may also be discovered, to allow a number of bioactive 
agents to act on a single pathway, for example for combination therapies. 

The screening methods of the present invention may be useful to screen a large 
number of cell types under a wide variety of conditions. Generally, the host cells are 
cells that are involved in disease states, and they are tested or screened under 
conditions that normally result in undesirable consequences on the cells. When a 
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suitable bioactive agent is found, the undesirable effect may be reduced or eliminated. 
Alternatively, normally desirable consequences may be reduced or eliminated, with an 
eye towards elucidating the cellular mechanisms associated with the disease state or 
signalling pathway. 

5 In a preferred embodiment the present methods are useful in cancer applications. 

The ability to rapidly and specifically kill tumor cells is a cornerstone of cancer 
chemotherapy. In general, using the methods of the present invention, random 
libraries can be introduced into any tumor cell (primary or cultured), and peptides 
identified which by themselves induce apoptosis, cell death, loss of cell division or 

10 decreased cell growth. This may be done de novo, or by biased randomization toward 

known peptide agents, such as angiostatin, which inhibits blood vessel wall growth. 
Alternatively, the methods of the present invention can be combined with other cancer 
therapeutics (e.g. drugs or radiation) to sensitize the cells and thus induce rapid and 
specific apoptosis, cell death, loss of cell division or decreased cell growth after 

15 exposure to a secondary agent. Similarly, the present methods may be used in 

conjunction with known cancer therapeutics to screen for agonists to make the 
therapeutic more effective or less toxic. This is particularly preferred when the 
chemotherapeutic is very expensive to produce such as taxol. 

Known oncogenes such as v-Abl, v-Src, v-Ras, and others, induce a transformed 
20 phenotype leading to abnormal cell growth when transfected Into certain cells. This is 

also a major problem with micro-metastases. Thus, in a preferred embodiment, non- 
transformed cells can be transfected with these oncogenes, and then random libraries 
introduced into these cells, to select for bioactive agents which reverse or correct the 
transformed state. One of the signal features of oncogene transformation of cells is 

2 5 the loss of contact inhibition and the ability to grow in soft-agar. When transforming 

viruses are constructed containing v-Abl, v-Src, or v-Ras in IRES-puro retroviral 
vectors, infected into target 3T3 cells, and subjected to puromycin selection, all of the 
3T3 cells hyper-transform and detach from the plate. The cells may be removed by 
washing with fresh medium. This can serve as the basis of a screen, since cells which 

3 0 express a bioactive agent will remain attached to the plate and form colonies. 

Similarly, the growth and/or spread of certain tumor types is enhanced by stimulatory 
responses from growth factors and cytokines (PDGF, EGF, Heregulin, and others) 
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which bind to receptors on the surfaces of specific tumors. In a preferred embodiment, 
the methods of the invention are used to inhibit or stop tumor growth and/or spread, by 
finding bioactive agents capable of blocking the ability of the growth factor or cytokine 
to stimulate the tumor cell. The introduction of random libraries into specific tumor 
5 cells with the addition of the growth factor or cytokine, followed by selection of 

bioactive agents which block the binding, signaling, phenotypic and/or functional 
responses of these tumor cells to the growth factor or cytokine in question. 

Similarly, the spread of cancer cells (invasion and metastasis) is a significant problem 
limiting the success of cancer therapies. The ability to inhibit the invasion and/or 
migration of specific tumor cells would be a significant advance in the therapy of 
cancer. Tumor cells known to have a high metastatic potential (for example, 
melanoma, lung cell carcinoma, breast and ovarian carcinoma) can have random 
libraries introduced into them, and peptides selected which in a migration or invasion 
assay, inhibit the migration and/or invasion of specific tumor cells. Particular 
applications for inhibition of the metastatic phenotype, which could allow a more 
specific inhibition of metastasis, include the metastasis suppressor gene NM23, which 
codes for a dinucleoside diphosphate kinase. Thus intracellular peptide activators cf 
this gene could block metastasis, and a screen for its upregulation (by fusing it to a 
reporter gene) would be of interest. Many oncogenes also enhance metastasis. 
Peptides which inactivate or counteract mutated RAS oncogenes, v-MOS, v-RAF, 
A-RAF, v-SRC, v-FES, and v-FMS would also act as anti-metastatics. Peptides which 
act intracellular^ to block the release of combinations of proteases required for 
invasion, such as the matrix metalloproteases and urokinase, could also be effective 
antimetastatics. 

25 In a preferred embodiment, the random libraries of the present invention are introduced 

into tumor cells known to have inactivated tumor suppressor genes, and successful 
reversal by either reactivation or compensation of the knockout would be screened by 
restoration of the normal phenotype. A major example is the reversal of 
p53-inactivating mutations, which are present in 50% or more of all cancers. Since 

3 0 p53's actions are complex and involve its action as a transcription factor, there are 

probably numerous potential ways a peptide or small molecule derived from a peptide 
could reverse the mutation. One example would be upregulation of the immediately 
downstream cyclin-dependent kinase p21CIP1/WAF1. To be useful such reversal 
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would have to work for many of the different known p53 mutations. This is currently 
being approached by gene therapy; one or more small molecules which do this might 
be preferable. 

Another example involves screening of bioactive agents which restore the constitutive 
5 function of the brca-1 or brca-2 genes, and other tumor suppressor genes important in 

breast cancer such as the adenomatous polyposis coli gene (APC) and the Drosophila 
discs-large gene (Dig), which are components of cell-cell junctions. Mutations of brca-1 
are important in hereditary ovarian and breast cancers, and constitute an additional 
application of the present invention. 

10 In a preferred embodiment, the methods of the present invention are used to create 

novel cell lines from cancers from patients. A retrovirally delivered short peptide which 
inhibits the final common pathway of programmed cell death should allow for short- 
and possibly long-term cell lines to be established. Conditions of in vitro culture and 
infection of human leukemia cells will be established. There is a real need for methods 

15 which allow the maintenance of certain tumor cells in culture long enough to allow for 

physiological and pharmacological studies. Currently, some human cell lines have 
been established by the use of transforming agents such as Ebstein-Barr virus that 
considerably alters the existing physiology of the cell. On occasion, cells will grow on 
their own in culture but this is a random event. Programmed cell death (apoptosis) 

2 0 occurs via complex signaling pathways within cells that ultimately activate a final 

common pathway producing characteristic changes in the cell leading to a non- 
inflammatory destruction of the cell. It is well known that tumor cells have a high 
apoptotic index, or propensity to enter apoptosis in vivo. When cells are placed in 
culture, the in vivo stimuli for malignant cell growth are removed and cells readily 
25 undergo apoptosis. The objective would be to develop the technology to establish 

cell lines from any number of primary tumor cells, for example primary human 
leukemia cells, in a reproducible manner without altering the native configuration of the 
signaling pathways in these cells. By introducing nucleic acids encoding peptides 
which inhibit apoptosis, increased cell survival in vitro, and hence the opportunity to 

3 0 study signalling transduction pathways in primary human tumor cells, is accomplished. 

In addition, these methods may be used for culturing primary cells, i.e. non-tumor cells. 
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In a preferred embodiment, the present methods are useful in cardiovascular 
applications. In a preferred embodiment, cardiomyocytes may be screened for the 
prevention of cell damage or death in the presence of normally injurious conditions, 
including, but not limited to, the presence of toxic drugs (particularly chemotherapeutic 
5 drugs), for example, to prevent heart failure following treatment with adriamycin; 

anoxia, for example in the setting of coronary artery occlusion; and autoimmune 
cellular damage by attack from activated lymphoid cells (for example as seen in post 
viral myocarditis and lupus). Candidate bioactive agents are inserted into 
cardiomyocytes, the cells are subjected to the insult, and bioactive agents are selected 
10 that prevent any or all of: apoptosis; membrane depolarization (i.e. decrease 

arrythmogenic potential of insult); cell swelling; or leakage of specific intracellular ions, 
second messengers and activating molecules (for example, arachidonic acid and/or 
lysophosphatidic acid). 

In a preferred embodiment, the present methods are used to screen for diminished 
arrhythmia potential in cardiomyocytes. The screens comprise the introduction of the 
candidate nucleic acids encoding candidate bioactive agents, followed by the 
application of arrythmogenic insults, with screening for bioactive agents that block 
specific depolarization of cell membrane. This may be detected using patch clamps, or 
via fluorescence techniques). Similarly, channel activity (for example, potassium and 
chloride channels) in cardiomyocytes could be regulated using the present methods in 
order to enhance contractility and prevent or diminish arrhythmias. 

In a preferred embodiment, the present methods are used to screen for enhanced 
contractile properties of cardiomyocytes and diminish heart failure potential. The 
introduction of the libraries of the invention followed by measuring the rate of change of 
myosin polymerization/depolymerization using fluorescent techniques can be done. 
Bioactive agents which increase the rate of change of this phenomenon can result in a 
greater contractile response of the entire myocardium, similar to the effect seen with 
digitalis. 

In a preferred embodiment, the present methods are useful to identify agents that will 
regulate the intracellular and sarcolemmal calcium cycling in cardiomyocytes in order 
to prevent arrhythmias. Bioactive agents are selected that regulate sodium-calcium 
exchange, sodium proton pump function, and regulation of calcium-ATPase activity. 
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In a preferred embodiment, the present methods are useful to identify agents that 
diminish embolic phenomena in arteries and arterioles leading to strokes (and other 
occlusive events leading to kidney failure and limb ischemia) and angina precipitating a 
myocardial infarct are selected. For example, bioactive agents which will diminish the 
5 adhesion of platelets and leukocytes, and thus diminish the occlusion events. 

Adhesion in this setting can be inhibited by the libraries of the invention being inserted 
into endothelial cells (quiescent cells, or activated by cytokines, i.e. IL-1, and growth 
factors, i.e. PDGF / EGF) and then screening for peptides that either: 1) downregulate 
adhesion molecule expression on the surface of the endothelial cells (binding assay); 
10 2) block adhesion molecule activation on the surface of these cells (signaling assay); 

or 3) release in an autocrine manner peptides that block receptor binding to the 
cognate receptor on the adhering cell. 

Embolic phenomena can also be addressed by activating proteolytic enzymes on the 
cell surfaces of endothelial cells, and thus releasing active enzyme which can digest 
15 blood clots. Thus, delivery of the libraries of the invention to endothelial cells is done, 

followed by standard fluorogenic assays, which will allow monitoring of proteolytic 
activity on the cell surface towards a known substrate. Bioactive agents can then be 
selected which activate specific enzymes towards specific substrates. 

in a preferred embodiment, arterial inflammation in the setting of vasculitis and 

2 0 post-infarction can be regulated by decreasing the chemotactic responses of 

leukocytes and mononuclear leukocytes. This can be accomplished by blocking 
chemotactic receptors and their responding pathways on these cells. Candidate 
bioactive libraries can be inserted into these cells, and the chemotactic response to 
diverse chemokines (for example, to the IL-8 family of chemokines, RANTES) inhibited 
25 in cell migration assays. 

In a preferred embodiment, arterial restenosis following coronary angioplasty can be 
controlled by regulating the proliferation of vascular intimal cells and capillary and/or 
arterial endothelial cells. Candidate bioactive agent libraries can be inserted into these 
cell types and their proliferation in response to specific stimuli monitored. One 

3 0 application may be intracellular peptides which block the expression or function of 

c-myc and other oncogenes in smooth muscle cells to stop their proliferation. A 
second application may involve the expression of libraries in vascular smooth muscle 
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cells to selectively induce their apoptosis. Application of small molecules derived from 
these peptides may require targeted drug delivery; this is available with stents, 
hydrogel coatings, and infusion-based catheter systems. Peptides which downregulate 
endothelin-1 A receptors or which block the release of the potent vasoconstrictor and 
5 vascular smooth muscle cell mitogen endothelin-1 may also be candidates for 

therapeutics. Peptides can be isolated from these libraries which inhibit growth of these 
cells, or which prevent the adhesion of other cells in the circulation known to release 
autocrine growth factors, such as platelets (PDGF) and mononuclear leukocytes. 

The control of capillary and blood vessel growth is an important goal in order to 
promote increased blood flow to ischemic areas (growth), or to cut-off the blood supply 
(angiogenesis inhibition) of tumors. Candidate bioactive agent libraries can be inserted 
into capillary endothelial cells and their growth monitored. Stimuli such as low oxygen 
tension and varying degrees of angiogenic factors can regulate the responses, and 
peptides isolated that produce the appropriate phenotype. Screening for antagonism of 
vascular endothelial cell growth factor, important in angiogenesis, would also be useful. 

In a preferred embodiment, the present methods are useful in screening for decreases 
in atherosclerosis producing mechanisms to find peptides that regulate LDL and HDL 
metabolism. Candidate libraries can be inserted into the appropriate cells (including 
hepatocytes, mononuclear leukocytes, endothelial cells) and peptides selected which 
lead to a decreased release of LDL or diminished synthesis of LDL, or conversely to an 
increased release of HDL or enhanced synthesis of HDL. Bioactive agents can also be 
isolated from candidate libraries which decrease the production of oxidized LDL, which 
has been implicated in atherosclerosis and isolated from atherosclerotic lesions. This 
could occur by decreasing its expression, activating reducing systems or enzymes, or 
blocking the activity or production of enzymes implicated in production of oxidized LDL, 
such as 15-lipoxygenase in macrophages. 

In a preferred embodiment, the present methods are used in screens to regulate 
obesity via the control of food intake mechanisms or diminishing the responses of 
receptor signaling pathways that regulate metabolism. Bioactive agents that regulate or 
3 0 inhibit the responses of neuropeptide Y (NPY), cholecystokinin and galanin receptors, 

are particularly desirable. Candidate libraries can be inserted into cells that have these 
receptors cloned into them, and inhibitory peptides selected that are secreted in an 
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autocrine manner that block the signaling responses to galanin and NPY. In a similar 
manner, peptides can be found that regulate the leptin receptor. 

In a preferred embodiment, the present methods are useful in neurobiology 
applications. Candidate libraries may be used for screening for anti-apoptotics for 
5 preservation of neuronal function and prevention of neuronal death. Initial screens 

would be done in cell culture. One application would include prevention of neuronal 
death, by apoptosis, in cerebral ischemia resulting from stroke. Apoptosis is known to 
be blocked by neuronal apoptosis inhibitory protein (NAIP); screens for its 
upregulation, or effecting any coupled step could yield peptides which selectively block 
10 neuronal apoptosis. Other applications include neurodegenerative diseases such as 

Alzheimer's disease and Huntington's disease. 

In a preferred embodiment, the present methods are useful in bone biology 
applications. Osteoclasts are known to play a key role in bone remodeling by 
breaking down "old" bone, so that osteoblasts can lay down "new" bone. In 
15 osteoporosis one has an imbalance of this process. Osteoclast overactivity can be 

regulated by inserting candidate libraries into these cells, and then looking for 
bioactive agents that produce: 1) a diminished processing of collagen by these cells; 2) 
decreased pit formation on bone chips; and 3) decreased release of calcium from bone 
fragments. 

2 0 The present methods may also be used to screen for agonists of bone morphogenic 

proteins, hormone mimetics to stimulate, regulate, or enhance new bone formation (in 
a manner similar to parathyroid hormone and calcitonin, for example). These have use 
in osteoporosis, for poorly healing fractures, and to accelerate the rate of healing of 
new fractures. Furthermore, cell lines of connective tissue origin can be treated with 

2 5 candidate libraries and screened for their growth, proliferation, collagen stimulating 

activity, and/or proline incorporating ability on the target osteoblasts. Alternatively, 
candidate libraries can be expressed directly in osteoblasts or chondrocytes and 
screened for increased production of collagen or bone. 

In a preferred embodiment, the present methods are useful in skin biology applications. 

3 0 Keratinocyte responses to a variety of stimuli may result in psoriasis, a proliferative 

change in these cells. Candidate libraries can be inserted into cells removed from 
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active psoriatic plaques, and bioactive agents isolated which decrease the rate of 
growth of these cells. 

In a preferred embodiment, the present methods are useful in the regulation or 
inhibition of keloid formation (i.e. excessive scarring). Candidate libraries inserted into 
5 skin connective tissue cells isolated from individuals with this condition, and bioactive 

agents isolated that decrease proliferation, collagen formation, or proline incorporation. 
Results from this work can be extended to treat the excessive scarring that also occurs 
in burn patients. If a common peptide motif is found in the context of the keloid work, 
then it can be used widely in a topical manner to diminish scarring post burn. 

Similarly, wound healing for diabetic ulcers and other chronic "failure to heal" 
conditions in the skin and extremities can be regulated by providing additional growth 
signals to cells which populate the skin and dermal layers. Growth factor mimetics 
may in fact be very useful for this condition. Candidate libraries can be inserted into 
skin connective tissue cells, and bioactive agents isolated which promote the growth of 
these cells under "harsh" conditions, such as low oxygen tension, low pH, and the 
presence of inflammatory mediators. 

Cosmeceutical applications of the present invention include the control of melanin 
production in skin melanocytes. A naturally occurring peptide, arbutin, is a tyrosine 
hydroxylase inhibitor, a key enzyme in the synthesis of melanin. Candidate libraries 
20 can be inserted into melanocytes and known stimuli that increase the synthesis of 

melanin applied to the cells. Bioactive agents can be isolated that inhibit the synthesis 
of melanin under these conditions. 

In a preferred embodiment, the present methods are useful in endocrinology 
applications. The retroviral peptide library technology can be applied broadly to any 

25 endocrine, growth factor, cytokine or chemokine network which involves a signaling 

peptide or protein that acts in either an endocrine, paracrine or autocrine manner that 
binds or dimerizes a receptor and activates a signaling cascade that results in a known 
phenotypic or functional outcome. The methods are applied so as to isolate a peptide 
which either mimics the desired hormone (i.e., insulin, leptin, calcitonin, PDGF, EGF, 

3 0 EPO, GMCSF, IL1-1 7, mimetics) or inhibits its action by either blocking the release of 

the hormone, blocking its binding to a specific receptor or carrier protein (for example, 
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CRF binding protein), or inhibiting the intracellular responses of the specific target 
cells to that hormone. Selection of peptides which increase the expression or release 
of hormones from the cells which normally produce them could have broad 
applications to conditions of hormonal deficiency. 

5 in a preferred embodiment, the present methods are useful in infectious disease 

applications. Viral latency (herpes viruses such as CMV, EBV, HBV, and other viruses 
such as HIV) and their reactivation are a significant problem, particularly in 
immunosuppressed patients ( patients with AIDS and transplant patients). The ability 
to block the reactivation and spread of these viruses is an important goal. Cell lines 

1 0 known to harbor or be susceptible to latent viral infection can be infected with the 

specific virus, and then stimuli applied to these cells which have been shown to lead to 
reactivation and viral replication. This can be followed by measuring viral titers in the 
medium and scoring cells for phenotypic changes. Candidate libraries can then be 
inserted into these cells under the above conditions, and peptides isolated which block 

15 or diminish the growth and/or release of the virus. As with chemotherapeutics, these 

experiments can also be done with drugs which are only partially effective towards this 
outcome, and bioactive agents isolated which enhance the virucidal effect of these 
drugs. 

One example of many is the ability to block HIV-1 infection. HIV-1 requires CD4 and a 
2 0 co-receptor which can be one of several seven transmembrane G-protein coupled 

receptors. In the case of the infection of macrophages, CCR-5 is the required co- 
receptor, and there is strong evidence that a block on CCR-5 will result in resistance to 
HIV-1 infection. There are two lines of evidence for this statement. First, it is known 
that the natural ligands for CCR-5, the CC chemokines RANTES, MIP1a and MIP1b 

2 5 are responsible for CD8+ mediated resistance to HIV. Second, individuals 

homozygous for a mutant allele of CCR-5 are completely resistant to HIV infection. 
Thus, an inhibitor of the CCR-5/HIV interaction would be of enormous interest to both 
biologists and clinicians. The extracellular anchored constructs offer superb tools for 
such a discovery. Into the transmembrane, epitope tagged, glycine-serine tethered 

3 0 constructs (ssTM V G20 E TM), one can place a random, cyclized peptide library of 

the general sequence CNNNNNNNNNNC or C-(X) n -C. Then one infects a cell line that 
expresses CCR-5 with retroviruses containing this library. Using an antibody to CCR-5 
one can use FACS to sort desired cells based on the binding of this antibody to the 
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receptor. All cells which do not bind the antibody will be assumed contain inhibitors of 
this antibody binding site. These inhibitors, in the retroviral construct can be further 
assayed for their ability to inhibit HIV-1 entry. 

Viruses are known to enter cells using specific receptors to bind to cells (for example, 
5 HIV uses CD4, coronavirus uses CD13, murine leukemia virus uses transport protein, 

and measles virus usesCD44) and to fuse with cells (HIV uses chemokine receptor). 
Candidate libraries can be inserted into target cells known to be permissive to these 
viruses, and bioactive agents isolated which block the ability of these viruses to bind 
and fuse with specific target cells. 

In a preferred embodiment, the present invention finds use with infectious organisms. 
Intracellular organisms such as mycobacteria, listeria, salmonella, Pneumocystis, 
yersinia, leishmania, T. cruzi, can persist and replicate within cells, and become active 
in immunosuppressed patients. There are currently drugs on the market and in 
development which are either only partially effective or ineffective against these 
organisms. Candidate libraries can be inserted into specific cells infected with these 
organisms (pre- or post-infection), and bioactive agents selected which promote the 
intracellular destruction of these organisms in a manner analogous to intracellular 
"antibiotic peptides" similar to magainins. In addition peptides can be selected which 
enhance the cidal properties of drugs already under investigation which have 
insufficient potency by themselves, but when combined with a specific peptide from a 
candidate library, are dramatically more potent through a synergistic mechanism. 
Finally, bioactive agents can be isolated which alter the metabolism of these 
intracellular organisms, in such a way as to terminate their intracellular life cycle by 
inhibiting a key organismal event. 

2 5 Antibiotic drugs that are widely used have certain dose dependent, tissue specific 

toxicities. For example renal toxicity is seen with the use of gentamicin, tobramycin, 
and amphotericin; hepatotoxicity is seen with the use of INH and rifampin; bone 
marrow toxicity is seen with chloramphenicol; and platelet toxicity is seen with 
ticarciliin, etc. These toxicities limit their use. Candidate libraries can be introduced 

3 0 into the specific ceil types where specific changes leading to cellular damage or 

apoptosis by the antibiotics are produced, and bioactive agents can be isolated that 
confer protection, when these cells are treated with these specific antibiotics. 
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Furthermore, the present invention finds use in screening for bioactive agents that 
block antibiotic transport mechanisms. The rapid secretion from the blood stream of 
certain antibiotics limits their usefulness. For example penicillins are rapidly secreted 
by certain transport mechanisms in the kidney and choroid plexus in the brain. 
Probenecid is known to block this transport and increase serum and tissue levels. 
Candidate agents can be inserted into specific cells derived from kidney cells and cells 
of the choroid plexus known to have active transport mechanisms for antibiotics. 
Bioactive agents can then be isolated which block the active transport of specific 
antibiotics and thus extend the serum halflife of these drugs. 

In a preferred embodiment, the present methods are useful in drug toxicities and drug 
resistance applications. Drug toxicity is a significant clinical problem. This may 
manifest itself as specific tissue or cell damage with the result that the drug's 
effectiveness is limited. Examples include myeloablation in high dose cancer 
chemotherapy, damage to epithelial cells lining the airway and gut, and hair loss. 
Specific examples include adriamycin induced cardiomyocyte death, 
cisplatinin-induced kidney toxicity, vincristine-induced gut motility disorders, and 
cyclosporin-induced kidney damage. Candidate libraries can be introduced into 
specific cell types with characteristic drug-induced phenotypic or functional responses, 
in the presence of the drugs, and agents isolated which reverse or protect the specific 
cell type against the toxic changes when exposed to the drug. These effects may 
manifest as blocking the drug induced apoptosis of the cell of interest, thus initial 
screens will be for survival of the cells in the presence of high levels of drugs or 
combinations of drugs used in combination chemotherapy. 

Drug toxicity may be due to a specific metabolite produced in the liver or kidney which 
is highly toxic to specific cells, or due to drug interactions in the liver which block or 
enhance the metabolism of an administered drug. Candidate libraries can be 
introduced into liver or kidney cells following the exposure of these cells to the drug 
known to produce the toxic metabolite. Bioactive agents can be isolated which alter 
how the liver or kidney cells metabolize the drug, and specific agents identified which 
prevent the generation of a specific toxic metabolite. The generation of the metabolite 
can be followed by mass spectrometry, and phenotypic changes can be assessed by 
microscopy. Such a screen can also be done in cultured hepatocytes, cocultured with 
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readout cells which are specifically sensitive to the toxic metabolite. Applications 
include reversible (to limit toxicity) inhibitors of enzymes involved in drug metabolism. 

Multiple drug resistance, and hence tumor cell selection, outgrowth, and relapse, leads 
to morbidity and mortality in cancer patients. Candidate libraries can be introduced 
into tumor cell lines (primary and cultured) that have demonstrated specific or multiple 
drug resistance. Bioactive agents can then be identified which confer drug sensitivity 
when the cells are exposed to the drug of interest or to drugs used in combination 
chemotherapy. The readout can be the onset of apoptosis in these cells, membrane 
permeability changes, the release of intracellular ions and fluorescent markers. The 
cells in which multidrug resistance involves membrane transporters can be preloaded 
with fluorescent transporter substrates, and selection carried out for peptides which 
block the normal efflux of fluorescent drug from these cells. Candidate libraries are 
particularly suited to screening for peptides which reverse poorly characterized or 
recently discovered intracellular mechanisms of resistance or mechanisms for which 
few or no chemosensitizers currently exist, such as mechanisms involving LRP (lung 
resistance protein). This protein has been implicated in multidrug resistance in ovarian 
carcinoma, metastatic malignant melanoma, and acute myeloid leukemia. Particularly 
interesting examples include screening for agents which reverse more than one 
important resistance mechanism in a single cell, which occurs in a subset of the most 
drug resistant cells, which are also important targets. Applications would include 
screening for peptide inhibitors of both MRP (multidrug resistance related protein) and 
LRP for treatment of resistant cells in metastatic melanoma, for inhibitors of both 
p-glycoprotein and LRP in acute myeloid leukemia, and for inhibition (by any 
mechanism) of all three proteins for treating pan-resistant cells. 

In a preferred embodiment, the present methods are useful in improving the 
performance of existing or developmental drugs. First pass metabolism of orally 
administered drugs limits their oral bioavailability, and can result in diminished efficacy 
as well as the need to administer more drug for a desired effect. Reversible inhibitors 
of enzymes involved in first pass metabolism may thus be a useful adjunct enhancing 
the efficacy of these drugs. First pass metabolism occurs in the liver, thus inhibitors of 
the corresponding catabolic enzymes may enhance the effect of the cognate drugs. 
Reversible inhibitors would be delivered at the same time as, or slightly before, the 
drug of interest Screening of candidate libraries in hepatocytes for inhibitors (by any 
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mechanism, such as protein downregulation as well as a direct inhibition of activity) of 
particularly problematical isozymes would be of interest. These include the CYP3A4 
isozymes of cytochrome P450, which are involved in the first pass metabolism of the 
anti-HIV drugs saquinavir and indinavir. Other applications could include reversible 
5 inhibitors of UDP-glucuronyltransferases, sulfotransferases, N-acetyltransferases, 

epoxide hydrolases, and glutathione S-transferases, depending on the drug. Screens 
would be done in cultured hepatocytes or liver microsomes, and could involve 
antibodies recognizing the specific modification performed in the liver, or cocultured 
readout cells, if the metabolite had a different bioactivity than the untransformed drug. 
1 0 The enzymes modifying the drug would not necessarily have to be known, if screening 

was for lack of alteration of the drug. 

In a preferred embodiment, the present methods are useful in immunobiology, 
inflammation, and allergic response applications. Selective regulation of T lymphocyte 
responses is a desired goal in order to modulate immune-mediated diseases in a 
15 specific manner. Candidate libraries can be introduced into specific T cell subsets 

(TH1, TH2, CD4+, CD8+, and others) and the responses which characterize those 
subsets (cytokine generation, cytotoxicity, proliferation in response to antigen being 
presented by a mononuclear leukocyte, and others) modified by members of the 
library. Agents can be selected which increase or diminish the known T cell subset 

2 0 physiologic response. This approach will be useful in any number of conditions, 

including: 1) autoimmune diseases where one wants to induce a tolerant state (select 
a peptide that inhibits T cell subset from recognizing a self-antigen bearing cell); 2) 
allergic diseases where one wants to decrease the stimulation of IgE producing cells 
(select peptide which blocks release from T cell subsets of specific B-cell stimulating 
25 cytokines which induce switch to IgE production); 3) in transplant patients where one 

wants to induce selective immunosuppression (select peptide that diminishes 
proliferative responses of host T cells to foreign antigens); 4) in lymphoproliferative 
states where one wants to inhibit the growth or sensitize a specific T cell tumor to 
chemotherapy and/or radiation; 5) in tumor surveillance where one wants to inhibit the 

3 o killing of cytotoxic T cells by Fas ligand bearing tumor cells; and 5) in T cell mediated 

inflammatory diseases such as Rheumatoid arthritis, Connective tissue diseases 
(SLE), Multiple sclerosis, and inflammatory bowel disease, where one wants to inhibit 
the proliferation of disease-causing T cells (promote their selective apoptosis) and the 
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resulting selective destruction of target tissues (cartilage, connective tissue, 
oligodendrocytes, gut endothelial cells, respectively). 



Regulation of B cell responses will permit a more selective modulation of the type and 
amount of immunoglobulin made and secreted by specific B cell subsets. Candidate 
5 libraries can be inserted into B cells and bioactive agents selected which inhibit the 

release and synthesis of a specific immunoglobulin. This may be useful in 
autoimmune diseases characterized by the overproduction of auto antibodies and the 
production of allergy causing antibodies, such as IgE. Agents can also be identified 
which inhibit or enhance the binding of a specific immunoglobulin subclass to a 
10 specific antigen either foreign of self. Finally, agents can be selected which inhibit the 

binding of a specific immunoglobulin subclass to its receptor on specific cell types. 



Similarly, agents which affect cytokine production may be selected, generally using two 
cell systems. For example, cytokine production from macrophages, monocytes, etc. 
may be evaluated. Similarly, agents which mimic cytokines, for example erythropoetin 
15 and IL1-17, may be selected, or agents that bind cytokines such as TNF-cc, before they 

bind their receptor. 



Antigen processing by mononuclear leukocytes (ML) is an important early step in the 
immune system's ability to recognize and eliminate foreign proteins. Candidate agents 
can be inserted into ML cell lines and agents selected which alter the intracellular 

2 0 processing of foreign peptides and sequence of the foreign peptide that is presented to 

T cells by MLs on their cell surface in the context of Class II MHC. One can look for 
members of the library that enhance immune responses of a particular T ceil subset 
(for example, the peptide would in fact work as a vaccine), or look for a library member 
that binds more tightly to MHC, thus displacing naturally occurring peptides, but 

2 5 nonetheless the agent would be less immunogenic (less stimulatory to a specific T cell 

clone). This agent would in fact induce immune tolerance and/or diminish immune 
responses to foreign proteins. This approach could be used in transplantation, 
autoimmune diseases, and allergic diseases. 



30 



The release of inflammatory mediators (cytokines, leukotrienes, prostaglandins, 
platelet activating factor, histamine, neuropeptides, and other peptide and lipid 
mediators) is a key element in maintaining and amplifying aberrant immune 
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responses. Candidate libraries can be inserted into MLs, mast cells, eosinophils, and 
other cells participating in a specific inflammatory response, and bioactive agents 
selected which inhibit the synthesis, release and binding to the cognate receptor of 
each of these types of mediators. 



5 In a preferred embodiment, the present methods are useful in biotechnology 

applications. Candidate library expression in mammalian cells can also be considered 
for other pharmaceutical-related applications, such as modification of protein 
expression, protein folding, or protein secretion. One such example would be in 
commercial production of protein pharmaceuticals in CHO or other cells. Candidate 

10 libraries resulting in bioactive agents which select for an increased cell growth rate 

(perhaps peptides mimicking growth factors or acting as agonists of growth factor 
signal transduction pathways), for pathogen resistance (see previous section), for lack 
of sialylation or glycosylation (by blocking glycotransferases or rerouting trafficking of 
the protein in the cell), for allowing growth on autoclaved media, or for growth in serum 

15 free media, would all increase productivity and decrease costs in the production of 

protein pharmaceuticals. 



Random peptides displayed on the surface of circulating cells can be used as tools to 
identify organ, tissue, and cell specific peptide targeting sequences. Any ceil 
introduced into the bloodstream of an animal expressing a library targeted to the cell 
20 surface can be selected for specific organ and tissue targeting. The bioactive agent 

sequence identified can then be coupled to an antibody, enzyme, drug, imaging agent 
or substance for which organ targeting is desired. 



Other agents which may be selected using the present invention include: 1) agents 
which block the activity of transcription factors, using cell lines with reporter genes; 2) 

2 5 agents which block the interaction of two known proteins in cells, using the absence of 

normal cellular functions, the mammalian two hybrid system or fluorescence 
resonance energy transfer mechanisms for detection; and 3) agents may be identified 
by tethering a random peptide to a protein binding region to allow interactions with 
molecules sterically close, i.e. within a signalling pathway, to localize the effects to a 

3 0 functional area of interest. 
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The following examples serve to more fully describe the manner of using the above- 
described invention, as well as to set forth the best modes contemplated for carrying 
out various aspects of the invention. It is understood that these examples in no way 
serve to limit the true scope of this invention, but rather are presented for illustrative 
5 purposes. All references cited herein are incorporated by reference in their entirety. 



EXAMPLES 



Example 1 
Proof of concept experiments 



A number of systems were used to prove that the retroviral constructs outlined herein 
1 0 were able to result in a selectable phenotype. 

Bcl2 and CPP32 Protection from apoptosis 

It is known that Bcl2 and the CPP32 peptide is able to inhibit apoptosis induced by 
tumor necrosis factor and cycloheximide. 

Apotaq assay: TUNEL (TdT-mediated dUTP-fluorescein nick end labeling^ Boehringer 

15 Mannheim kit, catalog no. 168795 

3T3 cells transiently infected with either MFGLacZ, BCL2, or CPP322 plasmids were 
grown to 50% confluence at the time of induction with hTNFa (50 ng/ml media) and 
cycloheximide (100 mg/ml media) for 6 hours. Cells were washed at 6 hours and 
harvested at 24 hours after induction. Cells were harvested by pooling all media from 

2 0 cells (in order to collect any apoptotic, floating cells) with the washings and trypsinized 

cells. The cells were spun and washed with PBS containing 1% BSA, transferred to an 
eppendorf tube and the wash repeated once. 

Cells were fixed in 4% paraformaldehyde at room temperature for 30 minutes, washed 
in PBS/BSA, then resuspended in permeabilisation buffer for 2 minutes on ice. After 

2 5 permeabilisation, cells were washed twice in PBS/BSA and incubated at 37°C for 1 

hour with labeling buffer containing fluoresceinated dUTP, unlabeled nucleotide 
mixture and terminal deoxynucleotidyl transferase (TdT). Cells were washed twice 
with PBS/BSA, resuspended in PBS/BSA and transferred to a FACS tube for analysis. 
Samples were also visualized under the fluorescence microscope. The results showed 

3 0 that expression of BcI2 or the CPP32 peptide in 3T3 cells from an MSCV retroviral 
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promoter in vivo was able to inhibit apoptosis induced by tumor necrosis factor and 
cycloheximide. 

Propidium Iodide staining of fixed ce lls to assay for apoptosis: (Sherwood and 
Schimke. Methods in Ceil Biology. 46:77-87. 1995) 3T3 cells transiently infected with 
5 MFGLacZ, Bcl2, or CPP32 were plated and treated with TNF/CXH as described 

above, and harvested and washed as above. Cells were then resuspended in 70% 
ethanol in PBS at 4oC and kept at 4oC overnight. When ready to FACS, cells were 
stained with propidium iodide as follows. Cells were spun at 14,000 RPM for 10 
seconds and washed once with PBS/BSA. Cells were then resuspended in 50 ml 

10 staining solution (PBS with 50 mg/ml RNase A (DNase-free) with 10 mg/ml propidium 

iodide) and incubated at 37°C for at least 1 hour. Cells were then pelleted and 
resuspended in PBS/BSA solution containing 10 mg/ml propidium iodide and analyzed 
by FACS scanning.The results showed that expression of BCL2 or CPP32 peptide in 
3T3 cells was able to inhibit apoptosis induced by tumor necrosis factor and 

15 cycloheximide as measure by PI staining of cells, extending our previous results. 

Ethidium Bromide/Acridine Qranae Staining of BA F3 Cells to Study Cell Morphology: 
BAF3 cells were infected with WZL IRES NEO retroviral vectors containing no insert 
(WIN) or DNA coding for LacZ (ZIN), Bcl2 (BIN), CPP32 peptide (CIN), or scrambled 
peptide control (PIN). Cells were selected in G418 after infection with above retroviral 

20 vectors and survivors were stimulated with 5 mg/ml FAS antibody. After stimulation, 

cells were stained with ethidium bromide and acridine orange (2 mg/ml each) and 
visualized under the fluorescence microscope using the ultraviolet filter. 250 cells 
were counted and the percent of cells which were apoptotic were calculated. Similar to 
results obtained in 3T3 cells stimulated with TNF/CXH, the CPP32 encoding vectors 

25 are able to inhibit FAS induced apoptosis. The peptide control also had an effect in 

this system approximately half of that seen with BCL2 or the CPP32 peptide. 

Enzvmatic Assa v of CPP32 activity: CPP32 Assay Kit: Clontech CPP32 Colorimetric 
Assay Kit (Cat. No. K2027-2): 3T3 cells were infected with the vectors described in 
Part I, section C, and selected in G418 media prior to assay. 6-well plates of 3T3 cells 
3 0 at near confluence were stimulated with TNF/CXH as described above and harvested 

at 30 min, 1 , 2 and 4 hours after stimulation as follows. Cells were trypsinized and 
collected as described above. After transfering to an eppendorf tube, cells were spun 
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and resuspended in 50 ml chilled Cell Lysis Buffer. Cells were incubated for 10 
minutes on ice, then 50 ml of 2X Reaction Buffer containing DTT was added to each 
tube. 5 ml of the colorimetric conjugated substrate (DEVD-paranitroanilide, 50 mM 
final concentration) was added to each tube and incubated at 37°C for 30 minutes. 
Samples were transferred to a 96 well plate and read on a spectrophotometer at O.D. 
of 405. The results showed that cell extracts from WIN cells have increased CPP32 
enzyme activity at 2 hours as measured by cleavage of DEVD-pNA substrate to its 
colorometrically detectable form pNA. By 4 hours, cells have begun to die and the 
activity is decreased. In cells containing BCL2 or the CPP32 peptide inhibitor, this rise 
in activity is not seen. In the case of BCL2, it should be due to inhibition of apoptosis 
upstream of the enzyme. With CPP32 inhibitor peptide, it should be due to direct 
inhibition of enzymatic activity. These in vitro results are consistent with the results 
seen in cell death assays described above. 

Localization studies using PKC inhibitor 

Murine 10T1/2 Clone 8 cells were stimulated with PMA which is known to cause 
translocation of PKC from the cytoplasm to the nucleus. This translocation is thought 
to be mediated through binding to a protein at the site of action, termed a RAC 
(receptor for activated protein kinase C) protein. Uninfected clone 8 cells were 
compared to cells infected with pBabe puro retroviral constructs containing sequences 
coding for either Flu-epitope (MGGGYPYDVPDYAGSLZ) tagged scrambled peptide 
control or inhibitor peptide (GKQKTKTIKGPP) which is identical to the C2 region of all 
the PKC isozymes. We then assayed the cells by immunohistochemistry using an 
antibody specific for PKCa and visualized with a secondary antibody conjugated to 
horseradish peroxidase. 

This experiment was done at two different cell densities as follows: 

1. Cells were plated at 2,000 cells/cm 2 onto 22 mm square polylysine coated 
coverslips and allowed to grow for 2 days. On 3/20, cells were nearly confluent. Cells 
were replated at a lower density and assayed with identical conditions on 3/27. 

2. PMA was added at 1 0" 5 M to the media for 30 minutes at 37°C. 

3. Cells were rinsed with SCB buffer (physiologic buffer prewarmed to 37°C before 
use) and then placed into 3.7% glutaraldehyde in SCB buffer for 20 minutes at 37°C. 

4. Cells were then washed in SCB buffer then incubated with SCBT (SCB containing 
0.1% Triton X-100) for 10 minutes at room temperature. 



WO 97/27212 



-54- 



PCT/US97/01019 



5. Coverslips were removed from the 6-well plate and dip washed in 0.1% tween/PBS 
at room temperature and placed onto parafilm in a covered container. 

6. Coverslips were incubated with 1.5% goat blocking serum in PBS with agitation in a 
humidified environment at room temperature. 

5 7.Solutions were aspirated off the coverslip and coversiips were then washed with 

PBS. Primary anti-PKCa antibody was placed onto coversiips and incubated for 30 
minutes at room temperature as above. A 1 :500 dilution of anitbody was used in all 
experiments. 

S.Coverslips were then washed with PBS three times and then incubated for 30 
10 minutes at room temperature with biotin-conjugated second step antibody as provided 

in Santa Cruz ABC ImmunoStain Sytems kit. Coversiips were then washed three 
times with PBS. 

9. Coversiips were then incubated in avidin biotin enzyme reagent (as supplied with kit) 
for 30 minutes at room temperature. Coversiips were then washed for 10 minutes in 

15 PBS after being placed back into 6-well plates. 

10. Coversiips were rinsed with 0.5% Triton X-100/PBS for 30 seconds and incubated 
in DAB solution for 5 minutes. Reaction was stopped by addition of distilled water to 
well. 

11. Coversiips were then dehydrated through alcohols and xylene and mounted onto 
20 slides with Permount and visualised and photographed by light microscopy. 

The result showed that basically, control clone 8 cells showed predominantly 
cytoplasmic and perinuclear staining, while PMA induced cells consistantly showed 
translocation to the nucleus. Cells infected with constructs coding for the scrambled 
peptide showed similar staining. Cells infected with constructs coding for peptides 

25 identical to the C2 region of PKC showed predominantly cytoplasmic and perinuclear 

staining in both control and PMA induced cells suggesting that this peptide is able to 
specifically inhibit translocation of activated PKCa to its RAC protein upon stimulation 
of the cells with PMA. It is also possible using similarly infected cells to see the 
downstream results of peptide expression upon gene activity. Ceils were infected with 

3 0 retroviruses expressing either the PKCb2.1 , PKC2.1 peptide, a dominant negative ras 

protein control, combinations of these viruses, or no virus at all. Cells were stimulated 
with PMA at 100 ng/ml, PDGF-AA, or PDGF-BB. mRNA was prepared and northern 
blots were performed for fos gene expression (induced by PKC activation) or the 
ribosomal protein P0, a loading control mRNA whose expression is not known to be 

3 5 acted upon by signaling systems induced by PKC. The PKC peptides can markedly 
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reduce expression of the fos gene mRNA. Indeed, an unexpected result was that 
under certain conditions there is activation of the mRNA expression. This latter results 
confirms that novel outcomes can occur upon expression of peptides within cells. 

Example 2 

5 pBabe Puro Retroviral Libraries and Apoptosis 

A series of retroviral constructs have been designed for expression of randomized and 
biased peptides within target cell populations. The peptide is expressed from a 
retroviral promoter. The translation unit has several important components. Glycine 
following the initiator methionine at the amino terminus stabilizes the peptide and 
enhances cytoplasmic half-life, according to Varshavsky's N-End Rule, in some 
constructs, a nine amino acid flu epitope tag has been incorporated to permit 
co-precipitation of the rare peptide and any molecule to which it has affinity, by using 
monoclonal antibodies to the epitope. Glycines are encoded before and after the 
random/biased expression product encoding regions to provide some molecular 
flexibility. Two carboxyl-terminal prolines are encoded to confer stability to 
carboxy peptidase. 

For construction of a large library two primers were made (schematized in Figure 1). 
The first, designated the random peptide primer, consists of 1 ) a complementary 
region for vector priming, 2) the regions mentioned above, and 3) a random or biased 
2 0 expression product region, were presented as a 30 base sequence encoding a peptide 

of length 10 amino acids. In addition, we have inserted a stop codon in all three 
reading frames in case of minor deletions or insertions in the random region. The 
design of the primer ensures a glycine/proline termination in most reading frames. The 
second primer is downstream in the vector and primes a region of the plasmid that 

2 5 contains a unique Not I site. These primers are used to create a library of fragments, 

each containing a different nucleotide sequence that each potentially encodes a 
different peptide. These families of fragments are ligated to vector fragments 
containing puromycin selection sequence, a 3*LTR, and a bacterial origin of replication. 
The ligation products are then electroporated into E. coli and DNA is prepared from the 

3 0 resulting library. Using this technique, we have constructed independent random 

libraries with up to 2 x 10 s unique inserts. Sequencing multiple individual inserts 
demonstrates they have the structure as defined by Primer 1 , and the peptides 
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encoded are random. Such libraries thus made contain subsets of the total 10 13 
predicted peptides. 

Generation of Retroviral P eptide Libraries. 

A scheme for generating a peptide library in the pBabe Puro vector is shown in Figure 
5 2. Primers for PCR were synthesized, purified and deprotected according to standard 

protocols. Primer 1, complementary to poly tinker sequences in the pBabe Puro 
retroviral construct, has the sequence 5' GCT TAG CAA GAT CTC TAC GGT GGA 
CCK NNK NNK NNK NNK NNK NNK NNK NNK NNK NNC CCC ACT CCC ATG GTC 
CTA CGT ACC ACC ACA CTG GG 3\ N represents any of the four bases; K is limited 

10 to G or T. Primer 2 has the sequence 5' GCT TAG CAA GAT CTG TGT GTC AGT 

TAG GGT GTG G 3* and is complementary to sequences within the pUC18 origin of 
replication. PCR was carried out for 8 rounds using primer 1 , primer 2, Babe Puro as 
template, and a mixture of Taq DNA Polymerase (Promega) and Deep Vent DNA 
Polymerase (New England Biolabs) in a ratio of 128 Taq: 1 Deep Vent as described in 

15 Barnes (1994) Proc. Natl. Acad. Sci. USA, 91, pp. 2216-2220. The amplified PCR 

product was purified, digested with restriction enzymes Bgl II and Not I (Promega), 
purified again and ligated with the corresponding Bam Hl-Not I fragment of pBabe 
Puro. After transformation the resulting library contained -2x1 0 8 clones, greater than 
80% of which contained inserts. 

20 oMSCV-PC and oBabeMN -PC retroviral construct libraries: 

Oligonucleotides were synthesized and purified according to standard protocols. The 
"library" oligonucleotides have the sequence 5' CTG GAG AAC CAG GAC CAT GGG 
C (NNK)1 0 GGG CCC CCT TAA ACC ATT AAA T 3' or 5' CTG GAG AAC CAG GAC 
CAT GGG CNN KNN KNN KCC TCC CNN KCC TNN KNN KGG GCC CCC TTA AAC 

2 5 CAT TAA AT 3'. A third oligonucleotide ("constant"), complementary to the 3' ends of 

the library oligonucleotides, has the sequence 5TCA TGC ATC CAA TTT AAT GGT 
TTA AG 3'. As shown in Fig. 3, each library oligonucleotide is annealed to the 
constant oligonucleotide, converted to double stranded DNA with Sequenase (United 
States Biochemical) or Klenow (Promega), digested with restriction enzyme Bst XI 

3 0 (New England Biolabs), and purified and ligated with the appropriate Bst Xl-digested 

retroviral construct. Transformation efficiencies are - 2 x 10 8 clones per microgram of 
ligated DNA, greater than 90% of which contain an insert. A representative retrovirus 
is shown in Fig. 4; see also, retroviral nucleotide sequence below: 
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Retroviral vector with presentation construct. 

TGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTAACGCCATTTTGCAA 
GGCATGGAAAATACATAACTGAGAATAGAGAAGTTCAGATCAAGGTTAGGAACAG 
AGAGACAGCAGAATATGGGCCAAACAGGATATCTGTGGTAAGCAGTTCCTGCCC 
5 CGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCCGCCCTCAGCAGT 
TTCTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAAATGACCCT 
GTGCCTTATTTGAACTAACCAATCAGTTCGCTTCTCGCTTCTGTTCGCGCGCTTCT 
GCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTCGGCGCGCCAGTCCT 
CCGATAGACTGCGTCGCCCGGGTACCCGTATTCCCAATAAAGCCTCTTGCTGTTT 

1 0 GCATCCGAATCGTGGACTCGCTGATCCTTGGGAGGGTCTCCTCAGATTGATTGA 
CTGCCCACCTCGGGGGTCTTTCATTTGGAGGTTCCACCGAGATTTGGAGACCCC 
TGCCTAGGGACCACCGACCCCCCCGCCGGGAGGTAAGCTGGCCAGCGGTCGTT 
TCCTGTCTGTCTCTGTCTTTGTGCGTGTTTGTGCCGGCATCTAATGTTTGCGCCT 
GCGTCTGTACTAGTTAGCTAACTAGCTCTGTATCTGGCGGACCCGTGGTGGAACT 

1 5 GACGAGTTCTGAACACCCGGCCGCAACCCTGGGAGACGTCCCAGGGACTTTGG 
GGGCCGTTTTTGTGGCCCGACCTGAGGAAGGGAGTCGATGTGGAATCCGACCC 
CGTCAGGATATGTGGTTCTGGTAGGAGACGAGAACCTAAAACAGTTCCCGCCTC 
CGTCTGAATTTTTGCTTTCGGTTTGGAACCGAAGCCGCGCGTCTTGTCTGCTGCA 
GCGCTGCAGCATCGTTCTGTGTTCTCTCTGTCTGACTGTGTTTCTGTATTTGTCTG 

2 0 AAAATTAGGGCCAGACTGTTACCACTCCCTTAAGTTTGACCTTAGGTCACTGGAA 
AGATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATGTCAAGAAGAGACGTTG 
GGTTACCTTCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGA 
CGGCACCTTTAACCGAGACCTCATCACCCAGGTTAAGATCAAGGTCTTTTCACCT 
GGCCCGCATGGACACCCAGACCAGGTCCCCTACATCGTGACCTGGGAAGCCTTG 

2 5 GCTTTTGACCCCCCTCCCTGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTC 

CTCTTCCTCCATCCGCCCCGTCTCTCCCCCTTGAACCTCCTCGTTCGACCCCGCC 
TCGATCCTCCCTTTATCCAGCCCTCACTCCTTCTCTAGGCGCCGGAATTCCAGGA 
CCATGGGCGGGCCCCCTTAAACCATTAAATTGGTAAAATAAAGGATCCGTCGACC 
TGCAGCCAAGCTTATCGATAAAATAAAAGATTTTATTTAGTCTCCAGAAAAAGGGG 

3 0 GGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTAACGCCATTTT 

GCAAGGCATGGAAAATACATAACTGAGAATAGAGAAGTTCAGATCAAGGTTAGGA 
ACAGAGAGACAGCAGAATATGGGCCAAACAGGATATCTGTGGTAAGCAGTTCCT 
GCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCCGCCCTCAG 
CAGTTTCTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAAATGA 
3 5 CCCTGTGCCTTATTTGAACTAACCAATCAGTTCGCTTCTCGCTTCTGTTCGCGCG 
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CTTCTGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTCGGCGCGCCA 
GTCCTCCGATAGACTGCGTCGCCCGGGTACCCGTGTATCCAATAAACCCTCTTG 
CAGTTGCATCCGACTTGTGGTCTCGCTGTTCCTTGGGAGGGTCTCCTCTGAGTG 
ATTGACTACCCGTCAGCGGGGGTCTTTCATTCGTAATCATGGTCATAGCTGTTTC 
5 CTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCAT 
AAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTG 
CGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGA 
ATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTC 
CTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAG 

1 0 CTCACTCAAAGGCGGTAATACGGTTATCC ACAGAATCAGGGGATAACGCAGGAAA 

GAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTT 
GCTGGCG 11 n I CCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACG 
CTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCC 
CCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATA 

15 CCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGT 
AGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAA 
CCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCC 
AACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATT 
AGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 

2 0 TACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTA 
CCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAG 
CGGTGG I I 1 1 I 1 I GTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAA 
GAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCAC 
GTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTA 

2 5 AATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGAC 

AGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTC 
ATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTA 
CCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCA 
GATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCT 

3 0 GCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAA 

GTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGT 
GGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCA 
AGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTC 
CTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGC 
3 5 AGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTG 
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GTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTC 
TTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTG 
CTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGT 
TGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTT 
5 ACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAA 
AGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATAT 
TATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTAT 
TTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCT 
GACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCAC 
0 GAGGCCCTTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACAT 
GCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGAC 
AAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAAC 
TATGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAATAC 
CGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCCATTCGCCATTCAGGC 
TGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGC 
TGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTT 
CCCAGTCACGACGTTGTAAAACGACGGCCAGTGCCACGCTCTCCCTTATGCGAC 
TCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACCGCCGCC 
GCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGCCACGGG 
GCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAG 
CCCGATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTG 
TGGCGCCGGTGATGCCGGCCACGATGCGTCCGGCGTAGAG 

Peptide Library Infection of a Factor-depen dent Line and Outgrowth of an 
ADOPtosis-Resistant Line 

The Baf/3 cell line is an lt-3 dependent cell that undergoes rapid apoptosis in the 
absence af IL-3. Thus it makes an attractive cell line for dominant effector peptides. 
Cells expressing a peptide that inhibits apoptosis are readily selected against the 
background of dying cells. We chose this cell line as a model for demonstrating 
peptide selection. 

A retroviral library containing 5 x 10 s independent peptide inserts was transfected into 
BOSC23 cells and converted into retrovirus with an approximate titer of 5 x 10 s per ml. 
Twelve ml of viral supernatant was used to infect 6 x 10 6 Baf/3cells (2 ml per infection 
of 1 x 10 6 cells in independent infections). Cells were grown for 3 days after infection 
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in the presence of IL-3 to allow retroviral integration and peptide expression. After 
three days IL-3 was withdrawn and the cells allowed to grow for two weeks. After two 
weeks, one well of six had outgrowth of cells that survive in the absence of IL-3, 
indicating the presence of an apoptosis-inhibiting peptide. Peptides derived in this 
5 manner may effect the IL-3 independence by positive dominancy (i.e., mimic or 

circumvent the positive regulatory role of IL-3) or by inhibition (i.e., prevent the 
apoptosis process upon IL-3 withdrawal). 

Example 3 

pMSCVpc Vector Construction and Apoptosis 

10 The retroviral vector pMSCVpc was prepared by cloning an insert containing 

sequences encoding a Kozak translation initiation sequence, BstXI sites for cloning 
library inserts, Nrul and Xhol sites and stop codons in all three reading frames, into the 
EcoRI and BamHI sites of pMSCV neo. 

BstX I Restriction Digestion 
15 200 |jg pMSCVpc vector DNA was combined with 40 pl 1 0X NEBuffer 3 and 

30 Ml BstX I (10 units/pl) in a total volume of 400 \sl The sample was incubated 
overnight at 55 °C, phenol extracted, and digested with Xhol, and purified on a 
potassium acetate step gradient using 10, 15, 20 and 25% solutions of potassium 
acetate. The DNA was precipitated, with a recovery of 40%. 

20 Library Insert Preparation 

Oligonucleotide Synthesis 

Oligonucleotides (OL) with the following sequences were synthesized: 
OL-1: 5'-CTG GAG AAC CAG GAC C AT GGG CAA GAG AAA GGG CGA TGA GGT 
GGA TGG AGT GGG GCC CCC TT A AAC CAT TAA AT -3' 
2 5 The underlined region encodes a peptide with the sequence MGKRKGDEVDGVGPP. 

This peptide was shown to inhibit Fas-mediated and Staurosporin induced apoptosis 
when expressed in cells with a retrovirus. 
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OL-2: 5'-CTG GAG AAC CAG GAC CAT GGG CAA GAG AAA GGG CNN KNN KNN 
KGA KNN KGT GGG GCC CCC TTA AAC CAT TAA AT -3' 
Variable region: N = A,C,G,T (equimolar) K = G,T (equimolar) 
Limiting the K position of each codon to G or T reduces stop codon generation and 
5 codon usage bias. The underlined region encodes a randomized peptide with the 

sequence MGKRKGXXXD/EXVGPP. 

OL-3: 5'-TCA TGC ATC CAA TTT AAT GGT TTA AG-3' 

The 15 3-bases of OL-3 are complementary to the 15 3-bases of OL-1 and OL-2. 

OL-1 and OL-2 were synthesized at 1pM scale, while OL-3 was synthesized at 
1 0 standard 40nM scale. All of the oligos were synthesized with trityl-on, deprotected and 

purified on OPC columns according to the manufacturer's directions (Applied 
Biosystems). Each oligo was resuspended in 200pl 10mM Tris pH 8.5 without EDTA. 
The DNA concentration was determined by measuring the absorbance at 260 nm. 

PCR was done with 50 pmole of either OL-1 or OL-2 and 50 pmole of OL-3. Phenol 
15 extraction and ethanol precipitation was done t and the resulting DNA was run on a 

10% nondenaturing 10% acrylamide gel, with ethidium bromide staining. 

The samples were quantitated, ligated, precipitated and electroporated into 
electrocompetent TOP10P E. Coli (Invitrogen) using standard techniques (see Current 
Protocols in Molecular Biology, section 1.8.4). A test transformation yielded 5 x 10 9 
transformants per pg of pUC DNA. After transformation, the transformation efficiency 
was determined by plating dilutions onto LB-amp plates (100 pg/ml ampicillin) and 
counting surviving colonies. For the library insert generated from OL-2, a 4:1 
insertvector molar ration in the ligation gave a transformation efficiency of 3.98 x 10 7 
transformants per M9 vector DNA used in the ligation, with a large scale transformation 
efficiency of 4.8 X 10 7 transformants per ug vector. The vector alone ligation 
generated 40 fold fewer transformants. 1 0 colonies from the transformation with the 
OL-1 insert ligation were picked, cultured and the DNA prepared and sequenced to 
identify the correct clone. 

The remainder of the OL-2 library SOC/transfbrmation mixture was inoculated into 
30 500mI LB-amp (100 pg/ml ampicillin) and incubated at 37 °C with shaking (300 rpm). 
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The Abs 600 of the library culture was monitored. When the culture reached an Abs 600 of 
0.8 (approximately five hours), 100 pi were removed, pelleted, resuspended in 10ml 
LB/15% glycerol and stored in 1 ml aliquots at -80 °C (An AbSgoo of 0.8 equals a cell 
concentration of approximately 10 9 cells per ml. Therefore, for a library of 4.8 x 10 7 , 
5 each frozen aliquot will contain 200 library equivalents). 

Analysis of library diversity 

Surviving colonies plated above were screened by PCR with primers flanking the 
degenerate region to determine the fraction of clones which contained insert (>90%). 8 
insert-containing clones were picked and the nucleotide sequences of the degenerate 
10 and flanking non-degenerate regions determined. Each nucleotide was represented in 

the N positions with approximately 25% frequency, while G or T (but not A or C) was 
represented in the K positions with approximately 50% frequency. The frequency of 
stop codons generated in the degenerate region can be determined by this method as 
well. 

15 Generation o f library retrovirus and infection of Jurkat cells. 

DAY 0: Preparation of Phoenix Retrovirus Producer cells for Transfection: 
18-24 hours prior to transfection, Phoenix cells were evenly plated at 1.5-2 million 
cells per 60 mm plate in Producer cell growth media (DMEM: 10% FCS, 1% Penicillin- 
Streptomycin, 1% Glutamine). Cells were allowed to attach for 20 hours on the plates. 

2 0 DAY 1 : Transient transfection: The highest transfection frequencies are obtained with 

Phoenix cells that are 70-80% confluent at the time of transfection. The DNA in HBS 
(2XHBS = 8.0 g NaCI, 6.5 g HEPES, 10 ml Na 2 HP0 4 stock (5.25 g dibasic in 500 ml 
water), adjusted to pH 7, to a final volume of 500 mis, with a final pH adjustment to 7) 
was prepared for application to the Phoenix cells. About 5 minutes prior to 

2 5 transfection, chloroquine (Sigma) was added to each plate to 25 uM (chloroquine stock 

is 50 mM in ddH20; for 3 mL media + 1 ml DNA, add 2 p1). To a 15 ml conical tube, 
the following were added (per 6 cm plate, 5 plates total, with all reagents at room 
temperature): 

5 ug library DNA (DNA was added in a drop to side of tube) 

3 0 1 ug pMSCVpc lacZ virus vector 

438 u1 dd H20 (the DNA was washed to the bottom of tube with water). 
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61 ul 2M CaC1 2 (Mallinkrodt, catalog # 4160; make up in water, sterile filter and store 
tightly capped at4°C. 
500 ul Total volume. 

Samples were mixed thoroughly with finger tapping. Transfections with 5 ug pMSCVpc 
5 lac Z and with the OL-1 vector DNA were carried out for use as negative and positive 

controls, respectively.O.SmL 2xHBS was added to each tube quickly; the solution was 
bubbled vigorously with the automatic pipettor by keeping the eject button depressed) 
for 10 sec (the actual length of bubbling time depends on each batch of 2xHBS). The 
HBS/DNA solution was dispersed dropwise and evenly onto the media in each Phoenix 

10 cell plate dropwise (gently and quickly). The plates were observed under a 

microscope; evenly distributed very small black particles of precipitated DNA (like 
pepper) were visible. The plates were placed in a 37 oC incubator and rocked forward 
and backward a few times to evenly distribute the DNA/CaP04 particles. 6-8 hours 
post-transfection, the media was changed to 3 ml fresh DMEM t 10% FCS. Prior to the 

15 media change, the DNA precipitate was larger and more clearly visible under the 

microscope. 

DAY 2: Second media change. 

24 hours post-transfection, the media was changed again to 3 ml fresh DMEM, 10% 
FCS. The cells were placed at 32°C (the virus is more stable if incubation is carried 
2 0 out at 32°C, although 37°C is fine). 



DAY 3: Transduction of Jurkat EcoR cells. 

A sterile Acrodisc 0.45 micron syringe filter (Gelman Sciences) was attached to the 
end of a 10 ml sterile syringe and the injection stopper sterilly removed from the 
syringe barrel. At 48 hours post-transfection, the virus supernatant was removed from 

2 5 the Phoenix cells and added to the syringe barrel. The stopper was replaced and the 

virus supernatant was ejected dropwise into a clean, sterile conical tube. The Phoenix 
cell plates were set aside for X-Gal staining (see below). Polybrene was added to each 
viral supernatant (Sigma; 2.5 mg/ml in ddH20 = 500X; store at -20 oC) to a final 
concentration of 5 mg/ml. 4.5 x 10 6 Jurkat EcoR cells (Jurkat cells stably expressing 

30 the ecotropic retrovirus receptor) were pelleted for 1400 rpm for five min and 

resuspended in 9 mis of the OL-2 library virus supernatant. The cells were distributed 
in aliquots of 1 ml, or 5 x 10 5 cells, into the wells of a 24 well plate. 1 .5 x 10 6 Jurkat 
EcoR cells were similarly treated with 3mls each of the lacZ viral supernatant and the 
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OL-1 viral supernatant. Each cell plate was wrapped in parafilm, placed in a microplate 
carrier (DuPont) and centrifuged at 2500 rpm for 90 min at 32°C in a DuPont/Sorvall 
RT 6000B table top centrifuge. After centrifugation, the cells were observed under a 
microscope. The presence of large irregularly-shaped bodies representing fused 
5 Jurkats (each as large as 5-10 unfused cells) suggested successful infection. The 

parafilm was removed from the plates, which were placed at 32°C. After an additional 
16 hours at 32°C, the cells were loosened from the bottom of each well with gentle 
trituration and added to a 15 ml conical tube. The tubes were centrifuged at 1400 rpm 
for five min to the pellet the cells. The cells were resuspended in 5 mis fresh 

10 RPMI,10% FCS for every three wells of cells and added to a 60 mm plate (3 wells of 

cells per plate). 1 ml fresh RPMI, 10% FCS was added to each well of cells remaining 
in the 24 well plates. Plates were kept at 37°C for 72 hours, at which time the cells 
transduced with each virus were combined and an aliquot Jurkat cells stained with X- 
Gal. Unused viral supernatant was stored at -80°C for future transduction, although the 

1 5 titer drops by one-half for each freeze-thaw cycle. 

Petermingtipn pf tren$fgptipn efficiency 

Both tranfected Phoenix cells and transduced Jurkat cells were stained with X-Gal to 
gauge the transfection and transduction efficiencies. The purpose of co-transfecting 
the pMSCVpc lacZ virus vector with the library virus vector, as described above, was 

2 0 to permit an indirect assessment of the efficiencies of transfection and 

transduction. Preparation of solutions: fixative: PBS/0.10% Glutaraldehyde. 
Glutaraldehyde stock (Sigma cat # G5882) is a 25% solution, or 250X; stock staining 
solutions: i) 300mM/25X ferrocyanate solution: 25.3 g K4Fe(CN)6.3H20 (Mallinckrodt) 
+ 2.48 g MgCI2 (Sigma) in 200 ml H20; store at 4°C; ii. 300mM/25X ferricyanate 

25 solution: 19.75 g K3Fe(CN)6 (Sigma) + 2.48 g MgCl2 in 200 ml H20; store at 4°; iii. 

XGal (Molecular Probes) is made up as a 40mg/ml solution in DMF; store at -20°C in 
the dark; iv. 1X ferro/ferricyanate solution: add 4 ml 300mM/25x ferrocyanate solution 
and 4 ml 300mM/25X ferricyanate solution to 196 ml PBS; store at 4°C for up to one 
month; v. active staining solution: each time cells are to be stained, 100 pi 40mg/ml X- 

30 Gal is added to each 3 ml 1X ferro/ferricyanate solution; washing solution: PBS for 

Phoenix and other adherent cells; 1% FCS in PBS for Jurkat and other nonadherent 
cells. 
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The media was removed from the 60 mm plates of Phoenix cells or 5 x 1 0 5 Jurkat 
cells were pelleted in a 15 ml conical tube at 1400 rpm for five min. 2 ml of fixative 
were added to each 60 mm plate of Phoenix cells or Jurkat cells were resuspended in 
1 ml fixative. Cells were left in fixative for 2 min. For Phoenix cells, fixative was poured 
5 off and the cells were washed three times with PBS (first two washes were quick; for 

third wash, the PBS was left on the cells for 3 min). For Jurkat cells, the fixative was 
quenched by adding 5-IO ml PBS/1% FCS to each conical tube, inverting each tube 
five times and pelleting as before.3 ml of active staining solution were layered onto 
each 60 mm plate of Phoenix cells or each cell pellet of 5 x 10 5 Jurkat cells was 
10 resuspended in 1 ml of active staining solution and placed in a well of a 24 well plate. 

All cells were incubated at 37°C. The cells were observed under a microscope 24 
hours later. The efficiency of transfection of the Phoenix cells was estimated as the 
percentage of blue cells in a field. The efficiency of transduction of the Jurkat cells 
was estimated by counting blue cells in a hemocytometer. Transfection with 5 pg lacZ 
15 vector produced 50% blue Phoenix cells. Transduction of Jurkats with the resulting 

virus produced 30% blue Jurkat cells. Co-transfection of 1pg lacZ virus vector with 5 
|jg library virus vector produced 5-10% blue Phoenix cells. Transduction of Jurkats 
with the resulting virus resulted in 3-10% blue Jurkat cells. 

Selection of Jurkat cells with IqM anti-Fas. 

Titer IgM anti-Fas: A fresh batch of CH-1 1 IgM antibody to human Fas (Kamiya 
Biomedical Company; cat # MC-060)was tested to determine the effectiveness of 
induction of apoptosis. 5 x 10 5 Jurkat EcoR cells were pelleted at 1500 rpm for five 
min and resuspended in 1 ml RPMI/2.5% FCS plus serial dilutions of CH-1 1 antibody, 
50 ng/ml, 10 ng/ml, 2.0 ng/ml and 0.5 ng/ml final concentration. Cells in each dilution 
of antibody were placed in a well of a 24 well plate at 37°C for 48 hours, at which time 
4 ml acridine orange/ethidium bromide (Sigma; 100 pg/ml each in PBS; store in the 
dark at 4°C) was added to 100 ml cells on ice. Cells were examined in a 
hemocytometer under a 20 x objective with a filter combination suitable for reading 
fluorescein. 

30 2. 1 00 cells from each sample were counted and the number of cells in the following 

groups was recorded: 

1 . live ceils with normal nuclei (bright green chromatin with organized structure). 

2. early apoptotic (EA; bright green chromatin that is highly condensed or fragmented). 
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3. late apoptotic (LA; bright orange chromatin that is highly condensed or fragmented). 

4. necrotic cells (N; bright orange chromatin with organized structure). 

% apoptotic cells was calculated as EA + LA/total number of cells counted x 100 
Using 10 ng/ml of the CH-11 antibody, > 95% apoptosis of Jurkat EcoR cells was 
5 demonstrated. 

IgM anti-Fas selection of library-expressing Jurkats, 

9.6 x 10 6 OL-2 library-transduced Jurkat cells were pelleted and resuspended in 96 ml 
RPMI/2.5% FCS + 10 ng/ml CH-11 antibody. Cells were distributed in 1 ml aliquots of 1 
x 10 5 cells into each well of four 24 well plates. 4.8 x 10 6 lacZ-transduced Jurkats and 

10 OL-1-transduced Jurkats were similarly treated and each distributed into the wells of 

two 24 well plates. Plates were placed at 37°C for five days. The plates were checked 
daily for bacterial or yeast contamination. Cells were removed from any contaminated 
wells and 2 ml 1 0N NaOH was added to the empty wells to reduce the risk of spread of 
contamination to other wells. Little to no live cells were observed under the microscope 

1 5 after 2-3 days, confirmed by the red color of the media which had not been depleted of 

any nutrients. Five days after initial IgM anti-Fas treatment, 1 ml RPMI/20% FCS was 
added to each well. The cells were left at 37°C for an additional 10-14 days. The 
plates were checked frequently for contamination and treated as above. 10 days after 
addition of the RPMI/20% FCS, nearly every well of the OL-1 -transduced cells 

2 0 contained live colonies of cells, confirmed by the orange color of the nutrient-depleted 

media. The media in all wells of lacZ-transduced cells remained red, and little cell 
growth was observed in any of the wells. Selected wells of the OL-2 library-transduced 
cells contained live cells and nutrient-depleted media. During the next two weeks, cells 
were removed from all wells in which significant cell growth was occurring, as guaged 
25 by observing the cells directly under the microscope and monitoring the increasing 

nutrient depletion of the cell media. Cells from each well were resuspended in 5 ml 
fresh RPMI/10% FCS and placed in a 60 mm dish at 37 oC for 2-3 days. 

RNA isQfction 

RNA was isolated from the each surviving well population of OL-2 library-transduced 

3 0 Jurkat cells (17 wells) , as well from five surviving well populations of OL-1 - transduced 

cells, using the mRNA Capture Kit according to the manufacturer's protocol 
(Boehringer-Mannheimcat#1 787 896). Briefly: 
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5 x 10 5 cells from each dish were pelleted at 1400 rpm for five min in an Eppendorf 
tube, washed twice with PBS and resuspended in 200 ml lysis buffer and sheared by 
passing six times through a 21 guage needle attached to a 1 ml syringe. 4 ml 1:20 
dilution of biotinylated oligo(dT)20 was added to each sample and incubated for 3 min 
5 at 37°C. The mix was removed from each tube. Each tube was washed three times 

with 200 ml of washing buffer. Cells were also stored in 90%FCS/10%DMSO in 1 ml 
aliquots of 1 x 106 cells each in liquid nitrogen. 

RT PCR rescue of peptide-encodino inserts from selected cells. 
PCR was carried out using the TitanTM RT-PCR System (Boehringer Mannheim cat 
#1 855 476), using two primers: 5'pBL primer has the sequence: 5*-GAT CCT CCC 
TTT ATC CAG-3' and is complementary to nucleotides 1364-1381 of all pMSCVpc- 
based vectors and retrovirus mRNA, just upstream of the cloned insert. 3A primer has 
the sequence 5-CTA CAG GTG GGG TCT TTC-3' and is complementary to a 
sequence in all pMSCVpc-based vectors and retrovirus mRNA, just downstream of the 
cloned insert. 

Re-cloninc rescued p eotide-encodina inserts. 

Each PCR-rescued sample was extracted with phenol chloroform, ethanol precipitated 
and resupsended in 25 ml 10mM Tris pH 8.5. 3 ml nondenaturing DNA gel loading dye 
was added to 10 ml of each sample and run on a 10% acrylamide minigel with 
2 0 oligonucleotide quantitation standards and a 10 base pair ladder, as described above. 

Each lane contained one prominent band with the expected molecular weight of 216 
base pairs and minor background bands. The molarity of each sample was quantitated 
using NIH Image as before. Each sample was BstXI restriction digested, phenol 
extracted, ethanol precipitated and resuspended in 25 ml 10mM tris pH 8.5.The 

2 5 purified samples were loaded onto 10% acrylamide gels and quantitated as before. All 

samples contained a prominent band of 55 base pairs, the expected molecular weight 
for the restriction digested insert, as well as bands of 100 base pairs and 51 base pairs 
corresponding to each of the ends of the rescued DNA insert removed by the 
restriction enzyme. Each restriction digested, PCR-rescued insert was ligated at a 4:1 

3 0 insert: vector molar ratio with 100 ng pMSCVpc vector DNA, precipitated and 

electrotransformed as before. Surviving colonies for each transformation were PCR 
screened using the 5'pBL and 3A primers. 8 to 10 insert-containing colonies for each 
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transformation were cultured overnight, the cultures were pooled and a single mini- 
DNA preparation carried out for each pool. 

Fas-Selected Peptide clones: All peptides have the sequence: MET GLY LYS ARG 
LYS GLY XXX XXX XXX D/E XXX VAL GLY PRO PRO. Only the xxx xxx xxx D/E xxx 
5 amino acids are written above each DNA sequence below. 

From first library selection well: 

L1B3 INDIVIDUAL CLONES, FAS - SELECTED . 

THR ALA SER ASP ALA 
L1B3E1 ATG GGC AAG AGA AAG GGC ACG GCG TCT GAT GCT 
10 GTG GGG CCC CCT TAA 

TYR PRO SER ASP VAL 
L1B3E2 ATG GGC AAG AGA AAG GGC TAT CCT TCT GAT GTG 
GTG GGG CCC CCT TAA 

THR PRO SER ASP MET 

15 L1B3E3 ATG GGC AAG AGA AAG GGC ACG CCT TCG GAT ATG 

GTG GGG CCC CCT TAA 

THR ALA SER ASP LEU 
L1B3E6 ATG GGC AAG AGA AAG GGC ACG GCT TCT GAT CTT 
GTG GGG CCC CCT TAA 



20 



SER ASP ARG ASP ILE 
L1B3E7 ATG GGC AAG AGA AAG GGC TCT GAT AGG GAT ATT 
GTG GGG CCC CCT TAA 
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From second library selection well: 
L2A5 INDIVIDUAL CLONES, FAS SELECTED . 

TRP LEU LEU GLU PHE 
L2A5A2 ATG GGC AAG AGA AAG GGC TGG TTG CTA GAG TTT 
5 GTG GGG CCC CCT TAA 

TRP LEU LEU GLU PHE 
L2A5A3 ATG GGC AAG AGA AAG GGC TGG TTG ATA GAG TTT 
GTG GGG CCC CCT TAA 

TRP LEU LEU GLU PHE 
10 L2A5A6 ATG GGC AAG AGA AAG GGC TGG TTG CTA GAG TTT 

GTG GGG CCC CCT TAA 

TRP LEU LEU GLU PHE 
L2A5A8 ATG GGC AAG AGA AAG GGC TGG TTG CTA GAG TTT 
GTG GGG CCC CCT TAA 

15 SER TYR GLN ASP LEU 

L2A5A9 ATG GGC AAG AGA AAA GGC TCT TAC CAA GAT CTG 
GTG GGG CCC CCT TAA 

Example 3 

Staurosporine selection of NIH 3T3 cells transduced with pBabe puro peptide library 

2 0 A. Library construction. Construction of the pBabe puro random peptide library was 

described earlier in the patent. The randomized peptide has the sequence: 
MGXXXXXXXXXXGGPP. The diversity of the library is 2 x 1 0 8 at the DNA insert level. 
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B. Library transfection. Transfections were carried out as described for Fas-selection, 
but in 15 cm plates of 10 7 Phoenix cells. The DNA solution added to each plate 
consisted of: 50 ug library DNA, 5 ug lacZ vector, 4340 ul ddH 2 O t 610 ul 2M CaCI 2 and 
5000 ul 2xHBS. 



5 C. Library transduction. 24 hours prior to transduction, 2 x 10 7 NIH 3T3 cells were 

plated in each of ten 15 cm plates in 25 ml DMEM, 10% Bovine Calf Serum. 5 ml 
library virus supernatant was added to each plate (plus polybrene as before). 24 hour 
after transduction, media was changed to 25 ml fresh DMEM, 10% BCS. Cells were 
stained with X-gal at 48 hours post-transduction. The transduction efficiency was 
1 0 estimated as 40-50%. 



D. Staurosporine selection. Staurosporine, an alkaloid from Streptomyces sp., is a 
potent, broad spectrum inhibitor of protein kinases which binds the ATP site. Addition 
of 1 uM staurosporine in serum-free media to NIH 3T3 cells induced >99% apoptosis 
within 24 hours, as determined by ethidium bromide/achdine orange double staining as 
1 5 described for the Fas selection. 



2 x 10 6 library-transduced NIH 3T3 cells were plated in each of 10 15 cm plates. Cells 
wer allowed to attach for 24 hours, at which time staursporine was added to 1 uM in 
serum free DMEM. LacZ-transduced NIH 3T3 cells and BCL-2-transduced NIH 3T3 
cells were used as negative and positive controls, rfespectively. 24 hours after 

20 stuarosporine treatment, the media was changed to 25 ml fresh DMEM, 10% BCS. 

The media was changed every two days for one week, until the sruviving cells looked 
healthy (typical 3T3 morphology), at which time 1 uM staurosporine in serum-free 
media was added again. The media wash changed to DMEM, 10% BCS as before. 
Stp treatment was carried out again for a total of three treatments, at which time the 

2 5 number of library-transduced cells suriving appeared greater than the number of lacZ- 

transduced cells (but less than the BCL-2-transduced cells. 



E Moloney transfer. After the second staurosporine treatment, aliquots of surviving 
cells from each plate were infected with wild type Moloney murine leukemia virus 
supernatant, (generated by transfecting Phoenix cells with the retroviral vector pZap). 
3 0 The virus was allowed to spread through the culture for one week (with re-plating of 
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the cells every 2-3 days). Cells were plated as before and treated with Staurosporine 
before proceeding to RNA isolation and PCR rescue. 

F. RNA isolation. Aliquots of cells surviving in each plate were resuspended in 90% 
FCS, 10% DMSO and stored in liquid nitrogen. RNA was prepared with Trizol reagent 

5 (Gibco BRL, cat # 15596-026). Briefly, 1 ml TRIzol reagent was added were 10cm 2 

monolayer of cells and incubated for 5 min at room temperature. Cell lysates were 
transferred to 15 ml conical tubes. (Note: at this point, DEPC-treated solutions and 
glassware were used exclusively). 0.2 ml chloroform was added per 1 ml TRIzol 
reagent used. Tubes were shaken for 15 sec, incubated for 3 min at room temperature 

10 and centrifuged at 12000 x g for 15 min at 4 °C. The RNA-containing upper aqueuous 

phase was removed and 0.5 mi isopropanol added per 1 ml TRIzol used for the initial 
homogenization. Samples were mixed and incubated at room temperature for 10 min 
followed by centrifugation as before, the supernatant was removed and the RNA pellet 
washed with 75% ethanol (1 ml per 1 ml TRIzol). The sample was vortexed and 

15 centrifuged at 7500 x g for 5 min at 4 °C. The RNA pellet was air-dried for 10 min and 

resusupended in RNase-free water with 10 min incubation at 60 °C to dissolve the 
pellet. RNA concentration was determined by measuring the absorbance at 260 nm. 

G. PCR rescue PCR rescue was carried out as for Fas selection, using the primers 
5'pBL and SV 40 down. The second primer has the sequence: 5' CTG ACA CAC ATT 

2 0 CCA CAG 3* and is complimentary to positions 1424-1441 of the pBabe Puro retroviral 

vector. PCR reactions were extracted with phenol-chloroform, precipitated with 
ethanol and digested with Bam HI and Sal I before ligation with the retroviral vector 
pWZL neo. The figure shows a 10% acrylamide gel of representative PCR-generated 
inserts: 

25 Lane 1: 10 base pair ladder 

Lane 2: undigested PCR insert from Stuarosporine-selected cell population 

Lane 3: undigested PCR insert from same cell population, after Moloney rescue and 

Staurosporine selection. 

Lanes 4 and 5: same as lanes 2 and 3, after restriction digestion. 



H. Secon dary screen. pWZI neo vectors containing rescued inserts were transfected 
into Phoenix cells, and the resulting virus used to transduce NIH 3T3 cells. 
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Staurosporine selection was repeated three times as before, before RNA preparation 
and PCR rescue. 

I Sequences o f the first 9 positives: 

The sequences of the first nine positives are as follows: 

5 SEQUENCE OF 2 P 1 

GGATCCAGTGTGGTGGTACGTAGGAATACC - 

ATG GGA TGT CCG TCT GTT GCT AGG CCG CGG GGT GGT GGG GGC CCC CCC 
Met Gly Cys Pro Ser Val Ala Arg Pro Arg Gly Gly Gly Gly Pro Pro 
TAGCTAACTAAAGATCCCAGTGTGGTGGTACGTAGGAATTCGCC 2 PI 
10 Stp Stp Stp Bam /Bg 

GGATCCCAGTGTGGTGGTACGTAGGAATACC - 

ATG GGA TTG TCT TTT GTT ATT (C/TGT CTG CAG CAT CGT GGG GGC CCC 
Met Gly Leu Ser Phe Val lie Arg Leu Gin His Arg Gly Gly Pro 
15 CCC TAG CTAACTAAAGATCCCAGTGTGGTGGTACGT 4 PI 

Pro Stp Stp StpBam /Bg 
Cys 

SEQUENCE QF 5 P 1 

GG AT C C C AGTGTGGTGGTACGT AGGAG T AC C - 
2 0 ATG GGA CCT CCG ATT TGG TAT ACT CAT TGG AGT CAT GGG GGC CCC CCC 

Met Gly Pro Pro He Trp Tyr Thr His Trp Ser His Gly Gly Pro Pro 
TAG CTAACTAAAGAT CC 5P1 
Stp Stp StpBam /Bg 

SEQUENCE QF 6 P 1 
2 5 GG ATCCCAGTGTGGTGGT ACGTAGGAGTAC C - 

ATG GAA GTC AGG CGT TTG TGA ATA CTC GGC ATA AG GGG GGC CCC CCC 
Met Glu Val Arg Arg Leu Stp Gly Gly Pro Pro 

TAGCTAACTAAAGAT CC 6 PI 
Stp Stp StpBam /Bg 
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SEQUENCE OF 7 P 1 

CCGGCCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGAT 

CTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCC 
7P1 

5 SEQUENCE OF 8 P 1 

GGATCCCAGTGTGGTGGTACGTAGGAATACC 

ATG GGA CTT TAG CCG GGC CCC CCCTAGCTAACTAAAGATCCCAGTGTGGTGGT 
Met Gly Leu Stp Pro Pro Stp Stp Stp Bam /Bg 

ACGTAGGAATTCGCCAGCACAG T 8 PI 

SEQUENCE OF 9 P 1 

GGATCCCAGTGTGGTGGTACGTAGGAATAC 

ATG GGA ACT GTT ATG GCG ATG TCG GAT TAG GTC GAG GGG GGC CCC CCC 
Met Gly Thr Val Met Ala Met Ser Asp Stp Gly Gly Pro Pro 

TAGCTAACTAAAGATCC 9P1 
Stp Stp Stp Bam /Bg 

SEQUENCE OF 10 P 1 
GGATCCAGTGTGGTGGTACGTAGGAATACC 

ATG GGA TGT CCG TCT GTT GCT AGG CCG CGG GGT GGT GGG GGC CCC CCC 
Met Gly Cys Pro Ser Val Ala Arg Pro Arg Gly Gly Gly Gly Pro Pro 
2 0 TAGCTAACTAAAGATCC 10 PI 

Stp Stp Stp Bam /Bg 

Example 4 
Use of NF-kB and NFAT in Signalling 

The NFKB/lkB complex is the classic pro-inflammatory second messenger system, 
known to be involved as a positive regulator of a number of pro-inflammatory 
processes and cytokines. These include, but are not limited to, IL-1, IL-6, IL-8, and 
TNF-a. As well, anti-inflammatory interleukins, such as IL-4, can lead to direct down- 
modulation of NF-kB in synovial fibroblasts and concommitant downregulation of IL-6 
production. The NF-kB/lkB complex is a widespread, acute-phase, rapid-response 
transcriptional activation system. It operates in most cell types tested, but leads to 
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different outcomes dependent upon the cell type and the nature of the initiating 
stimulus. Activators of NF-kB include LPS, TNF-a, IL-1 , inducers of T cell activation, 
protein synthesis inhibitors, phorbol esters, and a-lgM. Other inducers include the 
viruses Adenovirus, HTLV i, cytomegalovirus, Sendai, and Herpes simplex I, agents 
5 that cause cellular damage such as ultraviolet light and peroxides, and phosphatase 

inhibitors such as okadaic acid. These inducers act through PKA and PKC-dependent 
pathways, double-strand RNA-dependent kinase, and other pathways. Pharmacologic 
regulators of NF-kB, such as salicylate and glucocorticoids, act by either preventing 
IkB-a degradation or lead to upregulation of IkB-a transcription and steady-state levels, 
1 0 thereby acting to prevent the activation of this critical factor. 

NF-kB (Nuclear Factor that binds to the k locus B site) is present in the cytoplasm of 
most cells in an inactive form complexed to IkB (Inhibitor of NF-kB). Certain stimuli 
received by cells are processed by cellular signaling mechanisms and integrated in the 
specific phosphorylation of IkB and its degradation. The regulation of IkB-a function is 
1 5 through a Signal Response Element (SRE) in the amino terminus of the molecule. 

Phosphorylation of serine residues 32 and 36 leads to recognition of the IkB-a 
molecule by the ubiquitination machinery, release of NF-kB to the nucleus, and 
degradation of IkB. Therefore, dependent upon the phosphorylation/degradative state 
of IkB, NF-kB is either maintained in the cytoplasm or released to the nucleus. In the 

2 0 nucleus NF-kB binds to a consensus DNA motif found near the regulatory regions of 

many characterized genes and therein acts as a transcriptional regulator. Importantly, 
from the point of view of infectious disease, NF-kB is a primary activator of the Human 
Immunodeficiency Virus (HIV). Suitable induced genes include TNF-a and 1L-6. 

Biochemically, NF-kB is defined as a heterodimer of two polypeptides, p50 and p65, of 
25 corresponding molecular mass 50 and 65 kD. respectively. p50 is processed from a 

105 kD precursor protein by an as yet uncharacterized mechanism. p65 is the 
receptor for IkB and is the molecule through which IkB exerts its inhibitory/regulatory 
effects on NF-kB. These are the prototypic trasncription factors that define a large 
family of classical Rel/NF-kB factors. 

3 0 Cloning of the p50 and p65 components of NF-kB led to the discovery of a family of 

related factors, termed Rel. Both p50 and p65 have a 300 amino motif (Rel) at their 
amino termini that was originally described in the proto-oncogene c-rel and the 
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Drosophila axis-determining gene, Dorsal. The family of polypeptides revealed by p50 
and p65 have overlapping DNA-binding specificities, differential tissue distribution, and 
complex regulatory phenomena. p105(p50) is representative of the ankyrin-motif- 
containing Rel proteins that are processed in the cytoplasm to smaller proteins lacking 
5 the carboxyl terminus. The carboxyl terminus of p105 shows structural and functional 

homologies to IkB (which also has ankyrin motifs) and functions with an IkB-iike activity 
both in cis and in trans. p65 is representative of a second group of Rel proteins that 
have divergent carboxyl termini - these regions have been suggested to encode 
transcriptional activation domains. The 300 amino acids of Rel domains manifest four 
10 important functions: 1) DNA-binding in the roughly amino-terminal 1/3 of the domain, 2) 

dimerization in the carboxyl portion of the domain, 3) interaction with ankyrin- 
containing IkB-like proteins, and 4) nuclear-localizing signal at the carboxyl terminus of 
the Rel domain. In p50 the Rel domain also includes a transcriptional activation 
domain. 

NFAT, the Nuclear Factor of Activated T Cells (NFAT), is the immediate early acute 
phase response factor for T cell activation. Inhibition of NFAT by cyclosporin A (CsA) 
leads to blockade of IL-2 production and loss of T cell commitment to activation. 
NFAT, a critical component of pro-inflammatory events carried out by T cells, is also 
the factor blocked by CsA in transplantion. Upon cloning NFAT it was clear contains a 
region of the molecule implicated in DNA binding that has significant homology to the 
Rel family of proteins. Based on structural considerations, homology comparisons, 
and similar modes of action, as well as genomic structures of the molecules idiccate 
similar intron/exon boundaries in NF-kB and NFAT families, thus indicating that NFAT 
actually belongs to the Rel family of factors by lineal descent and that its interaction 
with pro-inflammatory trasncriptional regulators of the bZIP family would follow a 
general set of rules common to the NF-kB/bZIP interactions. 

We have shown that NFAT is involved in pro-inflammatory response to mitogens in 
activation of HIV-1 (S. Kinoshita and G.P.N, submitted) and that the binding of NFAT to 
sites overlapping the NF-kB sites of HIV-1 is reponsible for this process. This work 
3 0 follows on work by others showing that NFAT can regulate TNF-a activation in 

interaction with ATF-2/Jun and GM-CSF. Interestingly, NFAT also appears to be 
involved in regulation of mast cell release of IL-4, an important regulator of pro- 
inflammatory cytokines, such as IL-1p, TNF-a and IL-6. The activity of NFAT in these 
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systems has all shown to be pharmacologically modulated by CsA. Thus, although 
NFAT was originally discovered as a T cell specific factor, it was later found to be 
responsible for a host of immediate early, acute phase response activities, as well as 
direct regulation of IL-4. 

5 Therefore, the extended Rel families of NF-kB and NFAT make attractive targets for 

inhibition and modulation of pro-inflammatory action. Their involvement in numerous 
regulatory pathways and their decisive roles in such processes, including the specific 
interactions they elaborate with bZIP proteins, make them attractive specific targets for 
inhibition. 

10 Reporter Genes for detection of TNF-g and 1L-1 promoter activity, 

We designed a retrovirus-based luciferase reporter-gene system driven by a minimal 
promoter and two Igk NF-kB sites. In the constructs presented here, the deletions I 
introduced were more extensive than those previously published, since preliminary 
experiments showed that residual enhancer activity resided in commonly available 

15 deletion constructs (Nolan, Saksela and Baltimore, unpublished). The vectors 

designed were pSinll-luc (containing a luciferase gene in the retroviral sense 
orientation to test for residual promoter activity in the construct backbone), pSinll- 
fosluc (identical to pSinll luc except contains a minimal fos promoter element to test for 
residual enhancer activity in construct backbone), and pSin!l-2kBfostuc (derived from 

2 0 pSinll-fosluc with 2 Igk kB sites cloned 5' proximal to the fos minimal promoter as a 

reporter for NF-kB activity). These three vectors used to infect 1 x 10 6 70Z/3 cells. 
70Z/3 is a murine pre B cell line originally used in the initial characterization of NF-kB. 
After 48 hours, the infected cells were split into two fractions (stimulated with LPS and 
unstimulated). Six hours later, cell extracts were prepared and assayed for luciferase 
25 activity (extract representing ~10 4 cells was used for each point). The results showed 

that Sinll-luc showed no indiction, Sinil-fosluc showed roughly a one-fold increase, and 
Sinll-2kBfbsluc showed a four fold induction in lucerifase activity. Accordingly, 
retrovirally based reporter constructs can be used to sensitively report NF-kB activity in 
native chromatin. It now becomes possible to combine reporter gene technology with 

3 0 retroviral delivery of effector peptides. Unstimulated cells and stimulated controls 

(uninfected and Sinll-luc) showed little or no activity. Importantly, then, retroviral 
delivery did not result in significant background induction of NF-kB activity, a problem 
with other transfection procedures. The Sinil-luc and pSinll-fosluc controls shows no 
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significant residual promoter or enhancer activity in the construct. No significant 
readthrough from endogenous genomic loci or endogenous enhancer activity that 
might obscure readings was detected. These latter results are consistent with previous 
work using gene search retroviruses employing lacZ and flow cytometry. In these 
studies less than 0.1% of random integration events showed endogenous cis- 
regulation of the integrated constructs. 

These construct designs will be used as the basis for rapid creation and testing of 
TNF-a and IL-1 promoter studies in T cells, macrophages, and synovial cells. We will 
incorporate in the place of luciferase either the lacZ or GFP cDNAs for FACS-based 
assay. We will place up to three to four kilobases of TNF-a or IL-1 promoter region in 
place of the minimal promoter employed here. These constructs will be used as a 
proxy measure of endogenous TNF-a and IL-1 promoter activity and will serve to allow 
for searches for peptides from our libraries that act upon NF-kB or NFAT as well as 
unknown signaling pathways that are independent of NF-kB or NFAT critical to TNF-a 
and IL-1 signaling. 

The B cell lines to be used are 702/3. T cells to be used are human Jurkat. 
Macrophage lines to be used are Raw 309 and the P388D1 line which is highly 
responsive to PMA induction of secreted IL-1. Synovial cells to be used are HIG-82 
and can be activated with IL-1 to induce metalloproteases and with TNF-a to induce 
NF-kB. IL-1 induction of metalloproteases acts through NF-kB on collagenase and 
other metalloproteases of this group. Thus, we have shown that (J-gal fused to IkB-a 
and delivered via a retrovirus to cells responds to stimuli that degrade IkB-a as follows: 
a) 702/3 pre-B cells were infected with a retrovirus expressing a fusion of P-gal to 
either wild-type IkB-a or an inactive, dominant negative IkB-a; infection efficiency was 
approximately 30%. Cells were stimulated with LPS for varying times and then loaded 
with FDG for measure of b-gal expression by FACS. b) Cells from (a) were induced for 
maximal LPS induction of IkB-a degradation and treated with either salicylate or 
control. Salicylate blocked degradation of the P-gal-lkB fusion to the same extent as 
the dominant negative IkB-a. 

Direct detection in living cells of stead y state levels of IkB-a 

At the first approach, NF-kB activation will be measured using our newly developed 

IkB-a mobile reporter system described above. In this approach, the N-terminus of 
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IkB-a has been translationally fused to the lacZ gene. In mammalian ceils, P- 
galactosidase expression can be measured using the Fluorescence activated Cell 
Sorter (FACS) on a cell by cell basis. By coupling (J-gal to IkB-a, the stability of p-gal 
is functionally dependent upon IkB-a. Since signals in cells that activate NF-kB lead to 
5 the degradation of IkB, p-gal was similarly degraded; as above, cells were infected with 

a retrovirus containing a P-gal-lkB-a fusion and induced them with stimuli that lead to 
activation of NF-kB. We can use the cell sorter to distinguish cells that have degraded 
IkB-a on a REAL-TIME basis, and not through activation of proxy reporter genes. 
These lines were shown to respond accordingly after treatment with the anti- 
10 inflammatory agent salicylate (aspirin) which has been shown to be a direct inhibitor of 

NF-kB activation. We have used this and related protocols in B cells to select for novel 
mutants of IkB-a and have thereby defined new regions of the IkB-a molecule that 
respond to differential signaling (J. Caldwell and G. Nolan, unpublished). 

1x10 7 cells carrying the reporter wilt be infected at high efficiency with the molecular 
15 libraries described herein. Cells will be stimulated with LPS, TNF-a, IL-1 or PMA, and 

then used to select by FACS for those cells that DO NOT degrade p-gal. After growing 
out of the cells, the population will be restimulated as before and sorted again. Cells 
will be sorted until the population is 100% heritable for the lack of degradation 
phenotype . Inserts will be rescued, recloned into a retrovirus construct, and then 
2 0 screened again until a trans-phenotype can be confirmed. Peptides will be sequenced 

as noted. 

Selection NFAT-deficiencv u sing cell-death induction bv NFAT dependent pathways 
We have devised a system for selecting for blockade of NFAT signaling in cells that 
can be employed with our retroviral libraries. The system is based upon findings by 

2 5 Serafini and colleagues in which they were able to create a cell line whose death was 

dependent upon activation of NFAT. Cells stimulated by activators of T cells or NFAT 
lead to activation of NFAT and its translocation into the nucleus. Activation leads to 
induction of the diptheria toxin A gene such that the cells undergo rapid cell death. 
This is shown using Propidium iodide as a measure of cell viability. Thus, in a large 

3 0 population, those cells that are blocked for NFAT activation by peptides that interfere 

with the signaling system will survive. Serafini and colleagues used the approach to 
select for mutants in T cells signaling. We will use this proven NFAT-dipA systemin 
our peptide selections. 
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Again, cells will be infected as above with appropriate peptide libraries and screened 
for blockade of NFAT signaling. This basic approach, if successful, might be similarly 
applied to TNF-a or IL-1 signaling. 

There is expectation that signaling systems exist whose purpose is to provide either 
5 pro-inflammatory and anti-inflammatory signaling. As noted above, IL-4 for instance 

can blockade IL-6 signaling in cells. Induction of glucocorticoid expression leads to 
upregulation of IkB and thereby blocks NF-kB activation. Activation of anti-oxidant 
pathways is well known to be similarly anti-inflammatory. Salicylate blocks NF-kB 
through regulation of cellular oxygenase levels. Although the peptide searches 
10 outlined above might find players in such pathways intracellular^, we desire to search 

for surface molecules that might initiate such protective cascades. 

The peptide libraries in constructs for secreted peptides and tethered peptides will be 
used in T cell, macrophage, and B cell systems to select for blockade OR activation of 
NF-kB induction. Stimuli will include TNF-a and IL-1 for blockade. Activation will utilize 

1 5 the FACS-based systems in"reverse w . That is, we will look for peptides whose 

expression leads to constitutive activation of and NF-kB reporter construct. In this 
case the reporter construct can be a TNF-a reporter driving lacZ or GFP. The 
construct can similarly be IL-1 driving lacZ or GFP. For endogenous loci, we can 
select for cells that induce VCAM or ICAM-1 expression after IL-1 signaling by FACS, 

2 0 both known to be pro-inflammatory responders. Again, both positive AND negative 

selection can be employed. For cells expressing tethered peptides, the selection is 
straightforward as the intracellular peptides above. Post-definition of the peptide 
sequence, it will be necessary to synthesize the peptide without the tether synthetically 
and determine if the peptide can workin the absence of the tether. 

2 5 For secreted peptides the setup is more difficult as the responder cell must display the 

phenotype and we must trace the peptide back to the SECRETING cell. For this 
approach we can use any reporter gene or endogenous gene in the target cells as the 
readout. The cells to be Infected and which will secrete the peptides will be NIH3T3. 1 
x 10 7 3T3 cells will be infected with a fully representative library as outlined above. 

3 0 Cells post-infection will be allowed to form colonies of up to 1 0-20 cells. At this point 

media will be removed and the cells will be overlayed with a thin layer of 0.25% agar in 
media. Once solidified, a thin, porous membrane will be placed over the cells, and we 
will then overlay on this plate the responder cells at high density, also in 0.3% agar. 
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Plates and membranes will be marked with indigo black. In this way secreted product 
can diffuse to the responder cells. For selection of PRO-inflammatory secreted 
peptides, after 48 hours responder cells will be lifted from the plate on the membrane 
and the membrane/cells/agar will be flipped onto a correspondingly sized nitro- 
5 cellulose membrane. Cells will be lysed in situ by Sarcosyl or other appropriate 

detergent and then applied on the membrane to a high-salt solution and suction below 
the nitrocellulose. In this way cellular proteins will leach out of the agar matrix and bind 
to the nitrocellulose. The nitrocellulose can then be treated like a "Western" for 
induction or blockade of any of a number of different cellular proteins. In initial tests we 

1 o will use reporter genes driving enzymes such as b-gal or alkaline phosphatase to 

ensure assay sensitivity. As we perfect the assay it should be possible to set up direct 
measures of certain endogenous loci (such as TNF-a, NF-kB p65, etc.). Once cell 
areas on the membrane are noted, they can be traced back to the secretion cells by 
the indigo marking of the plates and alignment. NIH-3T3 cell "patches" corresponding 
15 to the appropriate area can be picked, expanded, and retested. As a positive control, 

viruses expressing TNF-a or IL-1 will be used in initial scaled mock-ups to calibrate the 
sensitivity of the search for pro-inflammatory peptides. 

Similarly, one can search for blockade of pro-inflammatory signaling. In this case, at 
24 to 36 hours post plating of the reponder cells, we will add a pro-inflammatory 

2 0 cytokine such as IL-1 or TNF-a to the agar layers in a liquid overlaying the 

agar/responders. The plate is now, from bottom to top: Secretor 
Cells/Membrane/Responder Cells/Liquid Overlay. The pro-inflammatory inducer will 
diffuse into the reposnder cells layer rapidly. Those cells that have been "protected" 
from pro-inflammatory events by a localized presence of an anti-inflammatory secreted 
25 peptide will not respond to the stimuli. As above, these can be detected against a 

background of responders by a nitrocellose assay for enzymatic activity. The latter, 
that is looking for "holes" against a background of positivity on the nitrocellose can be 
used to screen for inhibitors of pro-inflammatory events. As a positive control, viruses 
expressing IL-4 will be used in initial mock-ups to calibrate the sensitivity of the search 

3 0 for anti-inflammatory peptides. 
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CLAIMS 

I claim: 

1. A method for screening for a transdominant intracellular bioactive agent capable of 
altering the phenotype of a cell, said method comprising the steps: 

a) introducing a molecular library of randomized candidate nucleic acids into a plurality 
of cells, wherein each of said nucleic acids comprises a different nucleotide sequence; 

b) screening said plurality of cells for a cell exhibiting an altered phenotype, wherein 
said altered phenotype is due to the presence of a transdominant bioactive agent. 

2. A method according to claim 1 further comprising the step: 

c) isolating said cell exhibiting an altered phenotype. 

3. A method according to claim 2 further comprising the step: 

d) isolating a candidate nucleic acid from said cell. 

4. A method according to claim 2 or 3 further comprising the step: 

e) isolating a target molecule using 

i) a candidate nucleic acid; or 

ii) the expression product of a candidate nucleic acid. 

5. A method according to claim 1 wherein said randomized candidate nucleic acids 
are expressed in said cells to produce a plurality of randomized candidate expression 
products. 

6. A method according to claim 5 wherein said randomized candidate expression 
products are peptides. 

7. A method according to claim 5 wherein said randomized candidate expression 
products are nucleic acid transcripts. 

8. A method according to claim 1 wherein said nucleic acids further comprise a 
presentation sequence capable of presenting said expression product in a 
conformationally restricted form. 
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9. A method according to claim 1 wherein said introducing is with retroviral vectors. 

10. A method according to claim 1 wherein said cells are mammalian cells. 

11. A method according to claim 1 wherein said library comprises at least 10 4 different 
nucleic acids. 

5 12. A method according to claim 1 wherein said library comprises at least 10 5 different 

nucleic acids. 

13. A method according to claim 1 wherein said library comprises at least 10 6 different 
nucleic acids. 

14. A method according to claim 1 wherein said library comprises at least 10 7 different 
10 nucleic acids. 

15. A method according to claim 1 wherein said library comprises at least 10 8 different 
nucleic acids. 

16. A molecular library of retroviruses comprising at least 10 4 different randomized 
nucleic acids. 

15 17. A molecular library of retroviruses according to claim 21 comprising at least 10 5 

different randomized nucleic acids. 

18. A molecular library of retroviruses according to claim 21 comprising at least 10 6 
different randomized nucleic acids. 

2 0 1 9. A molecular library of retroviruses according to claim 21 comprising at least 10 7 

different randomized nucleic acids. 

20. A molecular library of retroviruses according to claim 21 comprising at least 10 8 
different randomized nucleic acids. 
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21 . A cellular library of mammalian cells containing a molecular library of retroviral 
constructs, said molecular library comprising at least 10 4 different randomized nucleic 
acids. 

22. A cellular library according to claim 26 wherein said constructs are integrated into 
5 the cellular genome. 
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